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Abstract 



Understanding, finding, or even deciding on the existence of real solutions to a system of 
equations is a very difficult problem with many applications. While it is hopeless to expect 
much in general, we know a surprising amount about these questions for systems which 
possess additional structure. Particularly fruitful — both for information on real solutions 
and for applicability — are systems whose additional structure comes from geometry. Such 
equations from geometry for which we have information about their real solutions are the 
subject of these notes. 

We will focus on equations from toric varieties and homogeneous spaces, particularly 
Grassmannians. Not only is much known in these cases, but they encompass some of the 
most common applications. The results we discuss may be grouped into three themes: 

(I) Upper bounds on the number of real solutions. 

(II) Geometric problems that can have all solutions be real. 

(Ill) Lower bounds on the number of real solutions. 

Upper bounds as in (1) bound the complexity of the set of real solutions — they are one 
of the sources for the theory of o-minimal structures which are an important topic in real 
algebraic geometry. The existence (II) of geometric problems that can have all solutions be 
real was initially surprising, but this phenomena now appears ubiquitous. Lower bounds as 
in (III) give an existence proof for real solutions. Their most spectacular manifestation is 
the nontriviality of the Welschingcr invariant, which was computed via tropical geometry. 
One of the most surprising manifestations of this phenomenon is when the upper bound 
equals the lower bound, which is the subject of the Shapiro conjecture. 

I thank the Institut Henri Poincare, where a preliminary version of these notes were 
produced during a course I taught there in November 2005. These notes were revised and 
expanded during course at Texas A&M University in 2007 and a lecture series at the Gentre 
Interfacultaire Bernoulli at EPFL in 2008, and completed in 2010 with material from a 
lecture I gave in January 20-09 the Theorem of Mukhin, Tarasov, and Varchcnko, and 
lectures at the GAeL meeting in Leiden in June 2009. I also thank Prof. Dr. Peter Gritzmann 
of the Technische Universitat Miinchen, whose hospitality enabled the completion of the 
first version of these notes. During this period, my research was supported by NSF grants 
DMS-0701059 and GAREER grant DMS-0538734. The point of view in these notes was 
developed through the encouragement of Bernd Sturmfels, Askold Khovanskii, and Marie- 
Frangoisc Roy, and through my interactions with the many whose work is mentioned were, 
including my collaborators from whom I have learned a great deal. 

Frank Sottilc 

03.01.10, GoUege Station, Texas. 



Chapter 1: 
Overview 

In mathematics and its applications, we are often faced with a system of polynomial equa- 
tions, and we need to study or find the solutions. Such systems that arise naturally typically 
possess some geometric or combinatorial structure that may be exploited to study their so- 
lutions. Such structured systems are studied in enumerative algebraic geometry, which has 
given us the deep and powerful tools of intersection theory [39] to count and analyze their 
complex solutions. A companion to this theoretical work are algorithms, both symbolic 
(based on Grobner bases [22l I122j ) and numerical (based on homotopy continuation |104] ) 
for solving and analyzing systems of polynomial equations. 

Despite these successes, this line of research largely sidesteps the often primary goal of 
formulating problems as solutions to systems of equations — namely to determine or study 
their real solutions. This deficiency is particularly acute in applications, from control [T7] . 
Kinematics [I5], statistics |HZ], and computational biology [85], for it is typically the real 
solutions that applications seek. One reason that traditional algebraic geometry ignores the 
real solutions is that there are few elegant theorems or general results available to study real 
solutions. Nevertheless, the demonstrated importance of understanding the real solutions 
to systems of equations demands our attention. 

In the 19th century and earlier, many elegant and powerful methods were developed to 
study the real roots of univariate polynomials (Sturm sequences, Budan-Fourier Theorem, 
Routh-Hurwitz criterion), which are now standard tools in some applications of mathematics. 
In contrast, it has only been in the pat few decades that serious attention has been paid 
toward understanding the real solutions to systems of polynomial equations. 

This work has concentrated on systems possessing some, particularly geometric, struc- 
ture. The reason for this is two-fold: systems from nature typically possess some special 
structure that should be exploited in their study, and it is unlikely that any results of sub- 
stance hold for general or unstructured systems. In this period, a story has emerged of 
bounds (both upper and lower) on the number of real solutions to certain classes of sys- 
tems, as well as the discovery and study of systems that have only real solutions. This 
Overview will sketch this emerging landscape and the subsequent chapters will treat these 
developments in more detail. 

Introduction 

Our goal will be to say something meaningful about the real solutions to a system of equa- 
tions. For example, consider a system 

/l(Xi, . . . ,Xn) = f2{Xi, . . . ,Xn) = ■■■ = /at (xi , . . . , X„) = 0, (1.1) 
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of N real polynomials in n variables. Let r be its number of real solutions and d its number 
of complex solutions^. Since every real number is complex, and since nonreal solutions come 
in conjugate pairs, we have the following trivial inequality 

d > r > d mod 2 e {0,1}. 

We can say nothing more unless the equations have some structure, and a particularly 
fruitful class of structures are those which come from geometry. The main point of this 
book is that we can identify structures in equations that will allow us to do better than this 
trivial inequality. 

Our discussion will have three themes: 

(I) Sometimes, the upper bound d is not sharp and there is a smaller bound for r. 

(II) For many problems from enumerative geometry, the upper bound is sharp. 

(Ill) The lower bound for r may be significantly larger than d mod 2. 

A lot of time will be spent on the Shapiro Conjecture (Theorem of Mukhin, Tarasov, and 
Varchenko [H]) and its generalizations, which is a situation where the upper bound of d is 
also the lower bound — all solutions to our system are real. 

We will not describe how to actually find the solutions to a system (11. ip and there will 
be no discussion of algorithms nor any complexity analysis. The book of Basu, Pollack, and 
Roy [2] is an excellent place to learn about algorithms for computing real algebraic varieties 
and finding real solutions. We remark that some of the techniques we employ to study real 
solutions underlie numerical algorithms to compute the solutions. 

One class of systems that we will study are systems of sparse polynomials. Integer vectors 
a = (oi, . . . , a„) G are exponents for (Laurent) monomials 

X . — X *^ 2 * * * •X Yi ^ ]^ J . . . ^ X^i^ X ^ . . . ^ n ] * 

Sometimes, we will just call elements of Z" monomials. Let .4 C Z" be a finite set of 
monomials. A linear combination 

of monomials from ^ is a sparse polynomial with support A. Sparse polynomials naturally 
define functions on the complex torus (C^)". A system (II. ip of = n polynomials in n 
variables, where each polynomial has support A, will be called a system (of polynomials) 
with support A. These are often called unmixed systems in contrast to mixed systems 
where each polynomial may have different support. While sparse systems occur naturally — 
multilinear or multihomogeneous polynomials are an example — they also occur in problem 

^'We shall always assume that our systems are generic in the sense that all of their solutions occur with 
multiplicity 1, and the number d of complex solutions is stable under certain allowed perturbations of the 
coefficients. 
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formulations for the simple reason that we humans seek simple formulations of problems, 
and this may mean polynomials with few terms. 

A fundamental result about unmixed systems is the Kushnirenko bound on their number 
of complex solutions. The Newton polytope of a polynomial with support A is the convex 
hull of the set A of monomials. Write vol(A) for the Euclidean volume of a polytope A. 

Theorem 1.1 (Kushnirenko [6]) ^4 system of n polynomials in n variables with common 
support A has at most n!vol(A^) isolated solutions in (C^)*^, and exactly this number when 
the polynomials are generic polynomials with support A. 

Bernstein generalized this to mixed systems. The Minkowski sum P + Q oi two polytopes 
in M" is their pointwise sum as sets of vectors in M". Let Pi, . . . , P„ C M" be polytopes. 
The volume 

V0l(tiPi + ^2^2 + ■■■ + tnPn) 

is a homogeneous polynomial of degree n in the variables ti, . . . ,tn |171 Exercise 15.2.6]. The 
mixed volume MV(Pi, . . . , P„) of Pi, . . . , P„ is the coefficient of the monomial tit2 ■ ■ - tn in 
this polynomial. 

Theorem 1.2 (Bernstein [7]) ^ system of n polynomials in n variables where the polyno- 
mials have supports Ai^ . . . , An has at most MV(A_4j, . . . , A_4„) isolated solutions in (C^)", 
and exactly this number when the polynomials are generic for their given support. 

Since MV(Pi, . . . , P„) = n!vol(P) when Pi = ■ ■ ■ = P„ = P, this generalizes Kush- 
nirenko's Theorem. We will prove Kushnirenko's Theorem in Chapter |3l but will not present 
a proof of Bernstein's Theorem. Instead, we suggest two excellent sources by Sturmfels. Both 
are similar, but the first is self-contained and superbly written. 

• |123] Polynomial equations and convex polytopes, Amei. Math. Monthly 105 (1998), 
no. 10, 907-922. 

• |124] Chapter 3 in Solving systems of polynomial equations, CBMS, vol. 97, American 
Mathematical Society, Providence, RI, 2002. 

The bound of Theorem 11.11 and its generalization Theorem 11.21 is often called the BKK 
bound for Bernstein, Khovanskii, and Kushnirenko [6]. 

1.1 Upper bounds 

While the number of complex roots of a univariate polynomial is typically equal to its degree, 
the number of real roots depends upon the length of the expression for the polynomial. 
Indeed, by Descartes's rule of signs [21] (see Section |2TT]) . a univariate polynomial with m+1 
terms has at most m positive roots, and thus at most 2m nonzero real roots. For example, 
the polynomial x'^ — a with a 7^ has 0, 1, or 2 real roots, but always has d complex 
roots. Khovanskii generalized this type of a bound to multivariate polynomials with his 
fundamental fcwnomial bound. 
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Theorem 1.3 (Khovanskii [65]) A system of n polynomials in n variables having a total 
of l+n+1 distinct monomials has at most 

2m(^ + iy+- 

nondegenerate positive real solutions. 

There are two reasons for this restriction to positive solutions. Most fundament aUy is 
that Khovanskii's proof requires this restriction. This restriction also excludes the following 
type of trivial zeroes: Under the substitution i-^ x], each positive solution becomes 2"' 
real solutions, one in each of the 2^ orthants. More subtle substitutions lead to similar extra 
trivial zeroes which differ from the positive solutions only by some sign patterns. 

This is the first of many results which verified the principle of Bernstein and Kushnirenko 
that the topological complexity of a set defined by real polynomials should depend on the 
number of terms in the polynomials and not on their degree. Khovanskii's work was also a 
motivation for the notion of o-minimal structures [1281 [86] . The main point of Khovanskii's 
theorem is the existence of such a bound and not the actual bound itself. For each l,n > 1, 
we define the Khovanskii number X{l,n) to be the maximum number of nondegenerate 
positive solutions to a system of n polynomials in n variables with l + n + 1 monomials. 
Khovanskii's Theorem gives a bound for X{1, n), but that bound is enormous. For example, 
when I = n = 2, the bound is 5184. Because of this, it was expected to be far from sharp. 
Despite this expectation, the first nontrivial improvement was only given in 2003. 

Theorem 1.4 (Li, Rojas, and Wang | I74] ) Two trinomials in two variables have at most 
5 nondegenerate positive real solutions. 

This bound sharp. Haas [H] had shown that the system of two trinomials in x and y 

lOx^^^ + lly^^ - lly = 10?/^°^ + llx^^ - llx = 0, (1.2) 

has 5 positive solutions. 

Since we may multiply one of the trinomials in (11. 2p by an arbitrary monomial with- 
out changing the solutions, we can assume that the two trinomials (11. 2p share a common 
monomial, and so there are at most 3 + 3 — 1 = 5 = 2 + 2 + 1 monomials between the 
two trinomials, and so two trinomials give a fewnomial system with I = n = 2. While 5 
is less than 5184, Theorem 11.41 does not quite show that X(2,2) = 5 as two trinomials do 
not constitute a general fewnomial system with I = n = 2. Nevertheless, Theorem 11.41 gave 
strong evidence that Khovanskii's bound may be improved. Such an improved bound was 
given in [13]. 

Theorem 1.5 X{l,n) < ^2(2)71'. 

For small values of /, it is not hard to improve this. For example, when / = 0, the 
support A of the system is a simplex, and there will be at most 1 positive real solution, so 
X{0,n) = 1. Theorem 11.51 was inspired by the sharp bound of Theorem 11.61 when / = 1 [9]. 
A set A of exponents is primitive if A affinely spans the full integer lattice Z". 
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Theorem 1.6 If I = 1 and the set A of exponents is primitive, then there can he at most 
2n+l nondegenerate nonzero real solutions, and this is sharp in that for any n there exist 
systems with n+2 monomials and 2n+l nondegenerate real solutions whose exponent vectors 
affinely span Z". 

Observe that this bound is for aU real solutions, not just positive solutions. We will 
discuss this in Section Further analysis gives the sharp bound for X{l,n). 

Theorem 1.7 (Bihan [TT]) n) = n + l. 

In contrast to these results establishing absolute upper bounds for the number of real 
solutions which improve the trivial bound of the number d of complex roots, there are a 
surprising number of problems that come from geometry for which all solutions can be real. 
For example, Sturmfels jl20] proved the following. 

Theorem 1.8 Suppose that a lattice polytope A C Z" admits a regular triangulation with 
each simplex having minimal volume ^. Then there is a system of sparse polynomials with 
support A n having all solutions real. 

For many problems from enumerative geometry, it is similarly possible that all solutions 
can be real. This will be discussed in Chapter [71 The state of affairs in 2001 was presented 
in [112] . 

1.2 The Wronski map and the Shapiro Conjecture 

The Wronskian of univariate polynomials fi(t), /2(t), . . . , fm{t) is the determinant 

Wr(A,...,/J := det((|rV.(t)),_^.^,,...,^ . 

When the polynomials fi have degree m+p—1 and are linearly independent, the Wronskian 
has degree at most mp. For example, if m = 2, then W{f, g) = f'g — fg', which has degree 2p 
as the coefficients of t^^^^ in this expression cancel. Up to a scalar, the Wronskian depends 
only upon the linear span of the polynomials /i, /2, . . . , fm- Removing these ambiguities 
gives the Wronski map, 

Wr := Gr(m,C™+,„i[t]) P'"^ (1.3) 

where Gr(m, Cm+p-i[t]) is the Grassmannian of m-dimensional subspaces of the linear space 
Cm+p-i[t] of polynomials of degree m+p—1 in the variable t, and P*"^ is the projective space 
of polynomials of degree at most mp, which has dimension equal to the the dimension of 
the Grassmannian. 

Work of Schubert in 1886 [SH], combined with a result of Eisenbud and Harris in 1983 [30] 
shows that the Wronski map is surjective and the general polynomial $ G P"*^ has 

#^ := l!2!---(m-l)!-(mp)! 
"^'^ m!(m+l)! ■ • ■ (m+p—1)! 
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preimages under the Wronski map. These results concern the complex Grassmannian and 
complex projective space. 

Boris Shapiro and Michael Shapiro made a conjecture in 1993/4 about the Wronski map 
from the real Grassmannian to real projective space. This was proven when min(A;, d+l—k) = 
2 by Eremenko and Gabrielov [31], and finally settled by Mukhin, Tarasov, and Varchenko |81j . 
They have subsequently found another proof [82] . 

Theorem 1.9 If the polynomial $ G P™^ has only real zeroes, then every point in Wr~-^($) 
is real. Moreover, if^ has mp simple real zeroes then there are ^m.p real points in Wr~-'^($). 

We will sketch the proof of Mukhin, Tarasov, and Varchenko in Chapter [101 This Shapiro 
Conjecture has appealing geometric interpretations, enjoys links to several areas of mathe- 
matics, and has many theoretically satisfying generalizations which we will discuss in Chap- 
ters [HI [H and [11] We now mention two of its interpretations. 

1.2.1 The problem of four lines 

A geometric interpretation of the Wronski map and the Shapiro Conjecture when m = p = 2 
is a variant of the classical problem of the lines in space which meet four given lines. Points 
in Gr(2, C^lt]) correspond to lines in as follows. The moment curve 7 in is the curve 
with parameterization 

7(t) := {t,t\t'). 

A cubic polynomial / is the composition of 7 and an affine-linear map C'^ C, and so 
a two-dimensional space of cubic polynomials is to a two-dimensional space of affine-linear 
maps whose common kernel is the corresponding line in C^. (This description is not exact, 
as some points in Gr(2,C3[t]) correspond to lines at infinity.) 

Given a polynomial of degree 4 with distinct real roots, points in the fiber Wr ^^($) 
correspond to the lines in space which meet the four lines tangent to the moment curve 7 
at its points coming from the roots of There will be two such lines, and the Shapiro 
conjecture asserts that both will be real. 

It is not hard to see this directly. Any fractional linear change of parameterization of 
the moment curve is realized by a projective linear transformation of 3-dimensional space 
which stabilizes the image of the moment curve. Thus we may assume that the polynomial 
is equal to (t^ — t){t — s), which has roots —1, 0, 1, and s, where s G (0, 1). Applying 
an affine transformation to 3-dimensional space, the moment curve becomes the curve with 
parameterization 

7 : t ^ (et^-i, It^ + ^t, 

Then the lines tangent to 7 at the roots —1, 0, 1 of ^ have parameterizations 
(-5 - s, 5 + s, -1) , (-1, s, s) , {5 + s ,5 + s ,1) s G M. 
These lie on a hyperboloid Q of one sheet, which is defined by 

l-xf + xj-xl = 0. (1.5) 
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Figure 1.1: Quadric containing three lines tangent to 7. 



We display this geometric configuration in Figure ll.li There, is the line tangent to 7 
at the point 7(«). The quadric Q has two rulings. One ruling contains our three tangent 
lines and the other ruling (which is drawn on Q) consists of the lines which meet our three 
tangent lines. 

Now consider the fourth line £{s) which is tangent to 7 at the point 7(s). This has the 
parameterization 

We compute the intersection of the fourth line with Q. Substituting its parameterization 
into (11. 5p and dividing by —12 gives the equation 

{s^ - s){s^ - s + t{6s'^ -2) + 9st^) = 0. 

The first (nonconstant) factor — s vanishes when (!{s) is equal to one of ('{ — I), ^(—0), or 
£(— l)-for these values of s every point of ('{s) lies on the quadric Q. The second factor has 
solutions 

_ - 1 ± V3s2 I 
9s ■ 

Since + 1 > for all s, both solutions will be real. 

We may also see this geometrically. Consider the fourth line f(s) for < s < 1. In 
Figure II. 2[ we look down the throat of the hyperboloid at the interesting part of this con- 
figuration. This picture demonstrates that i{s) must meet Q in two real points. Through 
each point, there is a real line in the second ruling which meets all four tangent lines, and 
this proves Shapiro's conjecture for m = p = 2. 

1.2.2 Rational functions with real critical points 

When m = 2, the Shapiro conjecture may be interpreted in terms of rational functions. 
A rational function p{t) = f{t)/g{t) is a quotient of two univariate polynomials, / and g. 
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7(s) 

Figure 1.2: The fourth tangent Une meets Q in two real points. 

This defines a map p: — )■ whose critical points are those t for which p'{t) = 0. Since 
P\t) = {f'g — g'f)/g'^i we see that the critical points are the roots of the Wronskian of / 
and g. Composing the rational function p: P^ — t- with an automorphism of the target 
P^ gives an equivalent rational function, and the equivalence class of p is determined by 
the linear span of its numerator and denominator. Thus Shapiro's conjecture asserts that a 
rational function having only real critical points is equivalent to a real rational function. 

Eremenko and Gabrielov [M] proved exactly this statement in 2002, and thereby estab- 
lished the Shapiro Conjecture in the case m = 2. 

Theorem 1.10 A rational function with only real critical points is equivalent to a real 
rational function. 

In Chapter [2] we will present an elementary proof of this result that Eremenko and 
Gabrielov found in 2005 |32j . 

1.3 Lower bounds 

We begin with some of perhaps the most exciting recent development in real algebraic 
geometry. It begins with the fundamental observation of Euclid that two points determine a 
line. Many people who have studied geometry know that five points on the plane determine 
a conic. In general, if you have m random points in the plane and you want to pass a 
rational curve of degree d through all of them, there may be no solution to this interpolation 
problem (if m is too big), or an infinite number of solutions (if m is too small), or a finite 
number of solutions (if m is just right). It turns out that "m just right" means m = 3d—l 
(m = 2 for lines and m = 5 for conies). 

A harder question is, if m = 3d—l, how many rational curves of degree d interpolate 
the points? Let's call this number A^^^, so that TVi = 1 and A''2 = 1 because the line and 
conic of the previous paragraph are unique. It has long been known that = 12, and in 
1873 Zeuthen |132] showed that iV4 = 620. That was where matters stood until 1989, when 
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Ran [89] gave a recursion for these numbers. In the 1990's, Kontsevich and Manin [69] used 
associativity in quantum cohomology of to give the elegant recursion 

(1.6) 

a+b=d ^ ^ ^ ^ ^ ^ 

which begins with the Euchdean declaration that two points determine a line (A^^i = 1). 
These numbers grow quite fast, for example iVs = 87304. 

The number of real rational curves which interpolate a given Sc?— 1 points in the real plane 
MP^ will depend rather subtly on the configuration of the points. To say anything about the 
real rational curves would seem impossible. However this is exactly what Welschinger [131] 
did. He found an invariant which does not depend upon the choice of points. 

A rational curve in the plane is necessarily singular — typically it has i^~2^ nodes. Real 
curves have three types of nodes. Only two types are visible in MP^, and we are familiar with 
them from rational cubics. The curve on the left below has a node with two real branches, 
and the curve on the right has a solitary point where two complex conjugate branches 
meet. 



The third type of node is a pair of complex conjugate nodes, which are not visible in MP^. 
Theorem 1.11 (Welschinger [131] ) The sum, 

^ ^^#{sohtary points, •,in C} 

over all real rational curves C of degree d interpolating ?)d— 1 general points in MP^ does not 
depend upon the choice of the points. 

Set Wd to be the sum (11.71) . The absolute value of this Welschinger invariant is then a 
lower bound for the number of real rational curves of degree d interpolating ?)d — 1 points 
in MP^. Since Ni = N2 = 1, we have Wi = W2 = 1. Prior to Welschinger's discovery, 
Kharlamov [231 Proposition 4.7.3] (see also Section [7.2. 2p showed that W3 = 8. The question 
remained whether any other Welschinger invariants were nontrivial. This was settled in the 
affirmative by Itenberg, Kharlamov, and Shustin [SH EZ], who used Mikhalkin's Tropical 
Correspondence Theorem [79] to show 



1. If > 0, then Wd> J. (Hence Wd is positive.) 

2. lim ^^llf± = 1. (In fact, log Nd ~ Sdhgd ~ logA^^.) 

d^oo log Wd 
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In particular, there are always quite a few real rational curves of degree d interpolating 3d—l 
points in MP^. Since then, Itenberg, Kharlamov, and Shustin [58] gave a recursive formula 
for the Welschinger invariant which is based upon Gathmann and Markwig's |15] tropical- 
ization of the Caporaso-Harris [18] formula. This shows that W4 = 240 and PV5 = 18264. 
Solomon |103j has also found an intersection-theoretic interpretation for these invariants. 

These ideas have also found an application. Gahleitner, Jiittler, and Schicho [H] pro- 
posed a method to compute an approximate parametrization of a plane curve using rational 
cubics. Later, Fiedler-Le Touze [36] used the result of Kharlamov (that W3 = 8), and an 
analysis of pencils of plane cubics to prove that this method works. 

While the story of this interpolation problem is fairly well-known, it was not the first 
instance of lower bounds in enumerative real algebraic geometry. In their investigation of 
the Shapiro conjecture, Eremenko and Gabrielov found a similar invariant am,p which gives 
a lower bound on the number of real points in the inverse image Wr ~^($) under the Wronski 
map of a real polynomial $ G M¥"^'p^ . Assume that m < p. If m + p is odd, set 

1!2! ■ ■ ■ (m-l)!(p-l)!(p-2)! ■ ■ ■ (p-m+l)!(^f )! 
(p-m+2)!(p-m+4)! • • ■ (p+m-2)! (^^^)! ■ ■ ■ (2±f^)! ' 

If m + p is even, then set am,p = 0. li m > p, then set am,p '■= crp,m- 

Theorem 1.12 (Eremenko-Gabrielov [,33j) // G MP™^ is a generic real polynomial 
of degree mp (a regular value of the Wronski map), then there are at least (Jm,p real Tri- 
dimensional subspaces of polynomials of degree m+p—1 with Wronskian $. 

Remark 1.13 Recall that the number of complex points in Wr"^($) is #m,p f!1.4p . It is 
instructive to compare these numbers. We show them for m+p = 11 and m = 2, . . . , 5. 



m 


2 


3 


4 


5 


^m;p 


14 


110 


286 


286 


^m,p 


4862 


23371634 


13672405890 


396499770810 



We also have ~ 3.4 x 10^ and ij^T^ ~ 9.5 x 10^®. Despite this disparity in their 
magnitudes, the asymptotic ratio of log((T.m,p)/ log(#m,p) appears to be close to 1/2. We 
display this ratio in the table below, for different values of m and p. 



log 


(c"m,p) 


m 


l0g(#m,p) 


2 


m+p— 1 
10 


Q m+p—1 
^ 10 


Q m+p—1 
10 


i m,+p- 1 
^ 10 


p; m+p-1 
^ 10 




100 


0.47388 


0.45419 


0.43414 


0.41585 


0.39920 


0.38840 


1 — 1 


1000 


0.49627 


0.47677 


0.46358 


0.45185 


0.44144 


0.43510 


1 


10000 


0.49951 


0.48468 


0.47510 


0.46660 


0.45909 


0.45459 


+ 


100000 


0.49994 


0.48860 


0.48111 


0.47445 


0.46860 


0.46511 


1000000 


0.49999 


0.49092 


0.48479 


0.47932 


0.47453 


0.47168 




10000000 


0.50000 


0.49246 


0.48726 


0.48263 


0.47857 


0.47616 



Thus, the lower bound on the number of real points in a fiber of the Wronski map appears 
asymptotic to the square root of the number of complex solutions. 
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It is interesting to compare this to the the result of Shub and Smale |100j that the 
expected number of real solutions to a system of n Gaussian random polynomials in n 
variables of degrees c^i , . . . , is y/di - ■ -dn, which is the square root of the number of complex 
solutions to such a system of polynomials. 

The idea behind the proof of Theorem ll.l2l is to compute the topological degree of the real 
Wronski map, which is the restriction of the Wronski map to real subspaces of polynomials, 

WrK := Wr|Gr(m,M„+,_i[t]) : Gr(m, R„+p_i[t]) MP'"^ 

This maps the Grassmannian of real subspaces to the space of real Wronski polynomials. 
Recall that the topological degree of a map f : X —> Y between two oriented manifolds 
X and Y of the same dimension is the number d such that = d[Y], where [X] and 

[Y] are the fundamental homology cycles of X and Y, respectively, and is the functorial 
map in homology. When / is different iable, this may be computed as follows. Let y & Y 
be a regular value of / so that at any point x in the fiber f~^{y) above y the derivative 
dxf'- T^X — > TyY is an isomorphism. Since X and Y are oriented, the isomorphism d^f 
either preserves the orientation or it reverses it. Let P be the number of points x G f~^{y) 
at which d^f preserves the orientation and R be the number of points where the orientation 
is reversed. Then the degree of / is the difference P — R. 

There is a slight problem in computing the degree of Wr^, as neither the real Grassman- 
nian nor the real projective space is orientable when m+p is odd, and thus the topological 
degree of Wr^ is not defined when m+p is odd. Eremenko and Gabrielov get around this by 
computing the degree of the restriction of the Wronski map to open cells of Gr^ and MP'^^, 
which is a proper map. They also show that it is the degree of a lift of the Wronski map to 
oriented double covers of both spaces. The degree bears a resemblance to the Welschinger 
invariant as it has the form | ^ ±1|, the sum over all real points in Wr ]^^($), for $ a regular 
value of the Wronski map. This resemblance is no accident. Solomon |103] showed how to 
orient a moduli space of rational curves with marked points so that the Welschinger invariant 
is indeed the degree of a map. 

While both of these examples of geometric problems possessing a lower bound on their 
numbers of real solutions are quite interesting, they are rather special. The existence of 
lower bounds for more general problems or for more general systems of polynomials would 
be quite important in applications, as these lower bounds guarantee the existence of real 
solutions. 

With Soprunova, we |105] set out to develop a theory of lower bounds for sparse polyno- 
mial systems, using the approach of Eremenko and Gabrielov via topological degree. This is 
a first step toward practical apphcations of these ideas. Chapter IH] will elaborate this theory. 
Here is an outline: 

(z) Realize the solutions to a system of polynomials as the fibers of a map from a toric 
variety. 

(m) Give a condition which implies that the degree of this map (or a lift to double covers) 
exists. 
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[iii) Develop a method to compute the degree in some (admittedly special) cases. 
{iv) Give a nice family of examples to which this theory applies. 

{v) Use the sagbi degeneration of a Grassmannian to a toric variety [122i Ch. 11] and the 
systems of (iv) to recover the result of Eremenko and Gabrielov. 

Example 1.14 We close this Chapter with one example from this theory. Let w,x,y,z be 
indeterminates, and consider a sparse polynomial of the form 

C4 wxyz 
+ csi^wxz + xyz) 
+ C2{wx + xz + yz) (1.8) 
+ ci(x + z) 

+ Co, 

where the coefficients cq, . . . , C4 are real numbers. 



Theorem 1.15 A system of four equations involving polynomials of the form (11. 8p has six 
solutions, at least two of which are real. 

We make some remarks to illustrate the ingredients of this theory. First, the monomials 
in the sparse system (11.81) are the integer points in the order polytope of the poset P, 



X • 



W9 



That is, each monomial corresponds to an order ideal of P (a subset which is closed upwards). 
The number of complex roots is the number of linear extensions of the poset P. There are 
six, as each is a permutation of the word wxyz where w precedes x and y precedes z. 

One result (ii) gives conditions on the Newton polytope which imply that any polynomial 
system with that Newton polytope has a lower bound, and order polytopes satisfy these 
conditions. Another result (iv) computes that lower bound for certain families of polynomials 
with support an order polytope. Polynomials in these families have the form (II. 8p in that 
monomials with the same total degree have the same coefficient. For such polynomials, the 
lower bound is the absolute value of the sum of the signs of the permutations underlying 
the linear extensions. We list these for P. 



permutation 


wxyz 


wyxz 


ywxz 


wyzx 


ywzx 


yzwx 


sum 


sign 


+ 




+ 


+ 




+ 


2 



This shows that the lower bound in Theorem 11.151 is 2. 
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We record the frequency of the different root counts in each of 10,000,000 instances of 
this polynomial system, where the coefficients were chosen uniformly from [—200,200]. 



number of real roots 





2 


4 


6 


frequency 





9519429 





480571 



This computation took 19,854 CPU-seconds, or 5:30:54 on a AMD Athlon 64-bit dual core 
processor 4600+ (2.4 GHz, 512KB cache), using a single core. 

The apparent gap in the numbers of real solutions (4 does not seem a possible number of 
real solutions) is proven for the system of Example 11.141 This is the first instance we have 
seen of this phenomena of gaps in the numbers of real solutions. More are found in [105j 
and [Si]. 
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Chapter 2: 

Real solutions to univariate 
polynomials 

Before we study the real solutions to systems of multivariate polynomials, we will review 
some of what is known for univariate polynomials. The strength and precision of results 
concerning real solutions to univariate polynomials forms the gold standard in this subject 
of real roots to systems of polynomials. We will discuss two results about univariate poly- 
nomials: Descartes' rule of signs and Sturm's Theorem. Descartes' rule of signs, or rather 
its generalization in the Budan-Fourier Theorem, gives a bound for the number of roots in 
an interval, counted with multiplicity. Sturm's theorem is topological — it simply counts the 
number of roots of a univariate polynomial in an interval without multiplicity. From Sturm's 
Theorem we obtain a symbolic algorithm to count the number of real solutions to a system 
of multivariate polynomials in many cases. We underscore the topological nature of Sturm's 
Theorem by presenting a new and very elementary proof due to Burda and Khovanskii [63j . 
These and other fundamental results about real roots of univariate polynomials were estab- 
lished in the 19th century. In contrast, the main results about real solutions to multivariate 
polynomials were only established in recent decades. 

2.1 Descartes' rule of signs 

Descartes' rule of signs [23] is fundamental for real algebraic geometry. Suppose that / is a 
univariate polynomial and write its terms in increasing order of their exponents, 

/ = cox"« + cix'^i + ■ ■ ■ + c^x'^™ , (2.1) 

where Q 7^ and Oq < cti < ■ ■ ■ < cim- 

Theorem 2.1 (Descartes' rule of signs) The number, r, of positive roots of f , counted 
with multiplicity, is at most the variation in sign of the coefficients of f , 

\ 1 < i < m and Ci^iCi < 0} < r , 

and the difference between the variation and r is even. 

We will prove a generalization, the Budan-Fourier Theorem, which provides a similar 
estimate for any interval in M. We first formalize this notion of variation in sign that 
appears in Descartes' rule. 

The variation var(c) in a finite sequence c of real numbers is the number of times that 
consecutive elements of the sequence have opposite signs, after we remove any Os in the 
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sequence. For example, the first sequence below has four variations, while the second has 
three. 

8, -4, -2, -1, 2, 3, -5, 7, 11, 12 - 1, 0, 1, 0, 1, -1, 1, 1, 0, 1 . 

Suppose that we have a sequence F = (/o, /i, . . . , fk) of polynomials and a real number 
a G M. Then var(F, a) is the variation in the sequence /o(a), • • • , This notion 

also makes sense when a = ±oo: We set var(F, oo) to be the variation in the sequence of 
leading coefficients of the fi{t), which are the signs of /j(a) for a ^ 0, and set var(F, —oo) 
to be the variation in the leading coefficients of fi{—t). 

Given a univariate polynomial f{t) of degree k, let 6f be the sequence of its derivatives, 

Sf ■■= {f{t)J'{t)J"{t),...J^'\t)). 

For a, 6 G M U {±oo}, let r(/, a, h) be the number of roots of / in the interval (a, 6], counted 
with multiplicity. We prove a version of Descartes' rule due to Budan [16] and Fourier [37] . 

Theorem 2.2 (Budan-Fourier) Let f G M[t] be a univariate polynomial and a < b two 
numbers m R U {±00} . Then 

var((5/,a) - var(5/,6) > r{f,a,b), 

and the difference is even. 

We may deduce Descartes' rule of signs from the Budan-Fourier Theorem once we observe 
that for the polynomial f(t) (12. ip . var(5/, 0) = var(co, ci, . . . , Cm), while var((5/, 00) = 0, as 
the leading coefficients of 6f all have the same sign. 

Example 2.3 The the sextic / = 5t^ — — 27t^ + 55t^ — 6 whose graph is displayed below 




has four real zeroes at approximately —0.339311, 0.340401, 1.59753, 2.25615. If we evaluate 
the derivatives of / at we obtain 

Sf(0) = -6, 0, 110, 0, -648, -480, 3600, 
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which has 3 variations in sign. If we evaluate the derivatives of / at 2, we obtain 

5/(2) = -26, -4, 574, 2544, 5592, 6720, 3600, 

which has one sign variation. Thus, by the Budan-Fourier Theorem, / has either 2 or 
roots in the interval (0, 2), counted with multiplicity. This agrees with our observation that 
/ has 2 roots in the interval [0, 2]. 

Proof of Budan-Fourier Theorem. Observe that var((5/, t) can only change when t passes a 
root c of some polynomial in the sequence 6f of derivatives of /. Suppose that c is a root 
of some derivative of / and let e > be a positive number such that no derivative f^^^ has 
a root in the interval [c — e, c + e], except possibly at c. Let m be the order of vanishing of 
/ at c. We will prove that 

(1) var((5/, c) = var(5/, c + e), and ^ ^ 

(2) var((5/, c — e) > vai{6f, c) + m, and the difference is even. 

We deduce the Budan-Fourier theorem from these conditions. As t ranges from a to b, 
both r{f,a,t) and var((5/, t) only change when t passes a root c of /. (These could change 
at a root of a derivative of /, but in fact do not.) At such a point, r{f,a,t) jumps by the 
multiplicity m of that root of /, while var(5/, t) drops by m, plus a nonnegative even integer. 
Thus the sum r(/, a, t) + var(5/, t) can only change at roots c of /, where it drops by an even 
integer. Since this sum equals var((5/, a) when t = a, the Budan-Fourier Theorem follows. 

Let us now prove our claim about the behavior of var(5/, t) in a neighborhood of a root 
c of some derivative /(*^ We argue by induction on the degree of /. When / has degree 1, 
then we are in one of the following two cases, depending upon the sign of /' 




In both cases, vai{Sf, c — e) = 1, but var((5/, c) = var(5/, c + e) =0, which proves the claim 
when / is linear. 

Now suppose that the degree of / is greater than 1 and let m be the order of vanishing 
of / at c. We first treat the case when /(c) = 0, and hence m > so that /' vanishes at c 
to order m— 1. We apply our induction hypothesis to /' and obtain that 

var((5/',c) = var((5/', c + e), and var(5/', c — e) > var(5/',c) + (m — 1) , 

and the difference is even. By Lagrange's Mean Value Theorem applied to the intervals 
[c — e, c] and [c, c -|- e] , / and /' must have opposite signs at c — e, but the same signs at c + e, 
and so 

var(5/, c) = var((5/', c) = var(5/', c + e) = var(5/, c + e) , 
var(5/, c — e) = var((5/', c — e) + 1 > var(5/', c) + (m — 1) + 1 = var((5/, c) + m , 
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and the difference is even. This proves the claim when /(c) =0. 

Now suppose that /(c) 7^ so that m = 0. Let n be the order of vanishing of /' at c. 
We apply our induction hypothesis to /' to obtain that 

var(5/', c) = var(5/', c + e), and var((5/', c — e) > var((5/',c) + n, 

and the difference is even. We have /(c) 7^ 0, but /'(c) = ■ ■ ■ = f'^'^\c) = 0, and f'^'^^^\c) 7^ 
0. Multiplying / by —1 if necessary, we may assume that /("+^)(c) > 0. There are four 
cases: n even or odd, and /(c) positive or negative. We consider each case separately. 

Suppose that n is even. Then both /'(c — e) and /'(c + e) are positive and so for each 
t E {c — e, c, c + e} the first nonzero term in the sequence 

/'(t), /"(t), /W(t) (2.3) 

is positive. When /(c) is positive, this imphes that var((5/, t) = var((5/',t) and when /(c) is 
negative, that var(5/, t) = var((5/', t) + l. This proves the claim as it implies that var((5/, c) = 
var(5/, c + e) and also that 

var(5/, c — e) — var(5/, c) = var(5/', c — e) — var(5/', c) , 

but this last difference exceeds n by an even number, and so is even as n is even. 

Now suppose that n is odd. Then f'{c — e) < < /'(c + e) and so the first nonzero 
term in the sequence (12. 3p has sign — , +, + as t = c — e, c, c + e. If /(c) is positive, then 
var(5/, c — e) = var(5/', c — e) + 1 and the other two variations are unchanged, but if /(c) 
is negative, then the variation at t = c — e is unchanged, but it increases by 1 at t = c and 
t = c + e. This again implies the claim, as var(5/, c) = vax{6f, c + e), but 

var(5/, c — e) — var((5/, c) = var((5/', c — e) — var(5/', c) ± 1 . 

Since the difference var(5/', c — e) — var(5/', c) is equal to the order n of the vanishing of /' 
at c plus a nonnegative even number, if we add or subtract 1, the difference is a nonnegative 
even number. This completes the proof of the Budan-Fourier Theorem. '^X^ 

2.2 Sturm's Theorem 

Let /, g be univariate polynomials. Their Sylvester sequence is the sequence of polynomials 

/o '■= fj fl '■= 9j f2i ■ ■ ■ 1 fk ■, 

where fk is a greatest common divisor of / and and 

-fi+i ■= remainder(/i_i,/i) , 

the usual remainder from the Euclidean algorithm. Note the sign. We remark that we have 
polynomials qi,q2, ■ ■ ■ , Qk-i such that 

/,_i(t) = q^{t)m - f,+,{t), (2.4) 

and the degree of /j+i is less than the degree of /«. The Sturm sequence of a univariate 
polynomial / is the Sylvester sequence of /, /'. 
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Theorem 2.4 (Sturm's Theorem) Let f be a univariate polynomial and a,b & MU{±oo} 
with a < b and f{a),f{b) ^ 0. Then the number of zeroes of f in the interval {a,b) is the 
difference 

var(F, a) — vai{F, b) , 
where F is the Sturm sequence of f . 



Example 2.5 The sextic / of Example 12.31 has Sturm sequence 



30t^ - 20t^ - 108t^ + not 
ft^ + ft'^'-ft'-ft + 6 

559584 y.3 , 143748 ^.2 _ 605394 y. _ 126792 
36125 ~'~ 1445 7225 7225 

229905821875 y.2 , 1540527685625 f , 7904908625 
724847808 ~'~ 4349086848 ~'~ 120807968 
280364022223059296 f , 174201756039315072 
58526435357253125 292632176786265625 
17007035533771824564661037625 
162663080627869030112013128 ' 



Evaluating the Sturm sequence at t = gives 



126792 174201756039315072 17007035533771824564661037625 
7225 ' 292632176786265625 ' 162663080627869030112013128 ' 



which has 4 variations in sign, while evaluating the Sturm sequence at t = 2 gives 



1114 3210228 1076053821625 2629438466191277888 17007035533771824564661037625 
' 45 ' 36125 ' 2174543424 ' 292632176786265625 ' 162663080627869030112013128 ' 



which has 2 variations in sign. Thus by Sturm's Theorem, we see that / has 2 roots in the 
interval [0, 2], which we have already seen by other methods. 



/ = 

fi:=f\t) = 

/2 = 

fs = 

/4 = 

/5 = 

/6 = 



An application of Sturm's Theorem is to isolate real solutions to a univariate polynomial 
/ by finding intervals of a desired width that contain a unique root of /. When (a, b) = 
(— oo, oo), Sturm's Theorem gives the total number of real roots of a univariate polynomial. 
In this way, it leads to an algorithm to investigate the number of real roots of generic 
systems of polynomials. We briefly describe this algorithm here. This algorithm was used in 
an essential way to get information on real solutions which helped to formulate many results 
discussed in later chapters. 

Suppose that we have a system of real multivariate polynomials 

. . . = /2(xi, . . . = ■■■ = /jv(xi, . . . = 0, (2.5) 

whose number of real roots we wish to determine. Let / C . . . , x„] be the ideal 

generated by the polynomials /i, /2, . . . , /at- If (12.51) has finitely many complex zeroes, then 
the dimension of the quotient ring (the degree of /) is finite, and for each variable Xi, there 
is a univariate polynomial g{xi) G / of minimal degree, called an eliminant for /. 

Proposition 2.6 The roots of g{xi) = form the set ofith coordinates of solutions to (12. 5p . 
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The main tool here is a consequence of the Shape Lemma |4]. 

Theorem 2.7 (Shape Lemma) Suppose that I has an eliminant g{xi) whose degree is 
equal to the degree of I. Then the number of real solutions to (12. 5 p is equal to the number 
of real roots of g . 

Suppose that the coefficients of the polynomials fi in the system (12 .Sp lie in a computable 
subfield of M, for example, Q (e.g. if the coefficients are integers). Then the degree of / may 
be computed using Grobner bases, and we may also use Grobner bases to compute an 
eliminant g{xi). Since Buchberger's algorithm does not enlarge the field of the coefficients, 
g{xi) G Q[xi] has rational coefficients, and so we may use Sturm sequences to compute the 
number of its real roots. 

Algorithm 

Given: / = (/i, . . . , /^) c Q[xi, ...,Xn] 

1. Use Grobner bases to compute the degree d oi I . 

2. Use Grobner bases to compute an eliminant g{x.i) G / fl Q[xj] for I. 

3. If deg{g) = d, then use Sturm sequences to compute the number r of real roots of 
g{xi), and output "The ideal I has r real solutions." 

4. Otherwise output "The ideal I does not satisfy the hypotheses of the Shape Lemma." 

If this algorithm halts with a failure (step 4), it may be called again to compute an elim- 
inant for a different variable. Another strategy is to apply a random linear transformation 
before eliminating. An even more sophisticated form of elimination is RouUier's rational 
univariate representation [93] . 

2.2.1 Traditional Proof of Sturm's Theorem 

Let f{t) be a real univariate polynomial with Sturm sequence F. We prove Sturm's Theorem 
by looking at the variation var(F, t) as t increases from a to b. This variation can only change 
when t passes a number c where some member fi of the Sturm sequence has a root, for then 
the sign of fi could change. We will show that if i > 0, then this has no effect on the 
variation of the sequence, but when c is a root of / = /o, then the variation decreases by 
exactly 1 as t passes c. Since multiplying a sequence by a nonzero number does not change 
its variation, we will at times make an assumption on the sign of some value fj (c) to reduce 
the number of cases to examine. 

Observe ffist that by (12. 4p . if /j(c) = /i+i(c) = 0, then /,_! also vanishes at c, as do the 
other polynomials fj. In particular /(c) = /'(c) = 0, so / has a multiple root at c. Suppose 
first that this does not happen, either that /(c) 7^ or that c is a simple root of /. 

Suppose that fi{c) = for some i > 0. The vanishing of fi at c, together with ( 12. 4p 
implies that /i-i(c) and /^(c) have opposite signs. Then, whatever the sign of fi(t) for t near 
c, there is exactly one variation in sign coming from the subsequence fi_i{t),fi{t),fi+i{t), 
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and so the vanishing of fi at c has no effect on the variation as t passes c. Note that this 
argument works equally well for any Sylvester sequence. 

Now we consider the effect on the variation when c is a simple root of /. In this case 
/'(c) 7^ 0, so we may assume that /'(c) > 0. But then f(t) is negative for t to the left of c 
and positive for t to the right of c. In particular, the variation var(F, t) decreases by exactly 
1 when t passes a simple root of / and does not change when / does not vanish. 

We are left with the case when c is a multiple root of /. Suppose that its multiplicity is 
m + 1. Then (t — c)™ divides every polynomial in the Sturm sequence of /. Consider the 
sequence of quotients, 

G = {go,..., 9k) := u/it-cr, f/{t-cr, f2/{t-cr, uKt-cD. 

Note that var(G',t) = var(F, t) when t 7^ c, as multiplying a sequence by a nonzero number 
does not change its variation. Observe also that G is a Sylvester sequence. Since gi{c) 7^ 0, 
not all polynomials Qi vanish at c. But we showed in this case that there is no contribution 
to a change in the variation by any polynomial gi with i > 0. 

It remains to examine the contribution of g^ to the variation as t passes c. If we write 
fit) = {t- c)'^+^h{t) with h{c) ^ 0, then 

f{t) = (m + l)(t-c)'"/i(t) + (t - c)'"+^/i'(t) . 

In particular, 

^o(t) = {t-c)h{t) and ^i(t) = {m + l)h{t) + {t-c)h'{t). 

If we assume that h{c) > 0, then gi{c) > and go{t) changes from negative to positive as t 
passes c. Once again we see that the variation var(F, t) decreases by 1 when t passes a root 
of /. This completes the proof of Sturm's Theorem. 

2.3 A topological proof of Sturm's Theorem 

We present a second, very elementary, proof of Sturm's Theorem due to Burda and Khovan- 
skii [63J whose virtue is in its tight connection to topology. We first recall the definition of 
topological degree of a continuous function (p: MP^ — )■ MP^. Since MP^ is isomorphic to the 
quotient R/Z, we may pull (p back to the interval [0, 1] to obtain a map [0, 1] — )■ MP^. This 
map lifts to the universal cover of MP^ to obtain a map ip: [0, 1] — >■ M. Then the mapping 
degree, mdcg((p), of (f is simply ip{l) — "ipiO), which is an integer. We call this mapping 
degree to distinguish it from the usual algebraic degree of a polynomial or rational function. 

The key ingredient in this proof is a formula to compute the mapping degree of a rational 
function ip: MP^ — t- MP^. Any rational function ip = f/g where f,g & M[t] are polynomials 
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has a continued fraction expansion of the form 



= go + ^ (2.6) 

Qi + 

q2 H 



1 

+ — 

where qo, ■ ■ ■ ,qk are polynomials. Indeed, this continued fraction is constructed recursively. 
If we divide f hj g with remainder h, so that / = qog + h with the degree of h less than the 
degree of g, then 

h 1 

V = qo + - = go H . 

9 9_ 
h 

We may again divide g hj h with remainder, g = qih + k and obtain 

1 

= goH 



1 

qi + 



h 
k 

As the degrees of the numerator and denominator drop with each step, this process termi- 
nates with an expansion (12 .Op of if. 

For example, if / = Af^ - ISt^ - Qt and g = At^ + St^ - 1, then 

/ 1 

- = t-2 + 

9 



-2t + l + 



-2t 



t 



This continued fraction expansion is just the Euclidean algorithm in disguise. 
Suppose that g = cq + cit + ■ ■ ■ + Cdf^ is a real polynomial of degree d. Define 

[q] := sign(Q)-(d mod 2) G {±1,0}. 

Theorem 2.8 Suppose thatip is a rational function with continued fraction expansion fl2.6p . 
Then the mapping degree of ip is 

We may use this to count the roots of a real polynomial / by the following lemma. 

Lemma 2.9 The number of roots of a polynomial f , counted without multiplicity is the 
mapping degree of the rational function f / f ■ 
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We deduce Sturm's Theorem from Lemma [2^ Let /o, /i, /2 • • • , A- be the Sturm sequence 
for /. Then /o = f, fi = f, and for i > 1, -fi+i := remainder ( _ i, /j). That is, 
deg(/j) < deg(/j_i) and there are univariate polynomials (71, (72, ■ ■ ■ ,gk with 

fi-i = 9ifi - fi+i for z = 1, 2, . . . , k-1 . 

We relate these polynomials to those obtained from the Euclidean algorithm applied to /, /' 
and thus to the continued fraction expansion of ///'. It is clear that the fi differ only by a 
sign from the remainders in the Euclidean algorithm. Set Tq := / and ri = /', and for i > 1, 
Ti := remainder(rj_2, rj_i). Then deg(rj) < deg(rj_i), and there are univariate polynomials 
gi,g2, with 

ri_i = qiVi + ri+i for i = 1, . . . , /c-1 . 
We leave the proof of the following lemma as an exercise for the reader. 

Lemma 2.10 We have gi = {—iy~^qi and fi = (— l)l-tJrj, for i = 1,2, . . . , k. 

Write F for the Sturm sequence (/o, /i, /2 • • • , /fc) for /. Then var(F, 00) is the variation 
in the leading coefficients (/o°'', fl°^, ■ ■ ■ , fk"^) of the polynomials in F. Similarly, var(F, —00) 
is the variation in the sequence 

Note that the variation in a sequence (cq, Ci, . . . , c^) is just the sum of the variations in each 
subsequence (cj_i, q) for i = 1, . . . , fc. Thus 

var(F, —00) — var(F, 00) 

= ^ (var((-l)<^^^(^'-)/n, - var(/n,/r)) . (2.7) 

Since = gifi - fi+i and deg(/i+i) < deg(/j) < deg(/i_i), we have 

= ^r^/f^ and deg(/i_i) = deg(^i) + deg(/i) . 

Thus we have 

var(/^°P, = var(^*°P,l), and 
var((-l)<^'=^(^>-)/n,(-l)'i<=s(/Ojtop^ ^ var((-l)^^^(^»)<7rP,l). 

Thus the summands in fl2.7p are 

^ar((-l)des(^'0^fP,l) - var((7rM) = sign (^7r)(deg(^7,) mod 2) 

= [9^] = i-iy-%], 

This proves that 

var(F, -00) - var(F, 00) = [gi] + [^2] H h [gk] 

= [qi] - [q2] + ■ ■ ■ + i-ir'[qk] . 
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But this proves Sturm's Theorem, as this is the number of roots of /, by Lemma [2.91 



The key to the proof of Lemma 12.91 is an alternative formula for the mapping degree 
of a continuous function ip\ MF^ — )■ MP-*^. Suppose that p G MP-*^ is a point with finitely 
many inverse images (p~^{p). To each inverse image we associate an index that records the 
behavior of ip{t) as t increases past the inverse image. The index is +1 if ip{t) increases 
past p, it is —1 if ip{t) decreases past p, and it is if stays on the same side of p. (Here, 
increase/ decrease are taken with respect to the orientation of RP^.) For example, here is a 
graph of a function Lp in relation to the value p with the indices of inverse images indicated. 




With this definition, the mapping degree of ip is the sum of the indices of the points in a 
fiber ip~^{p), whenever the fiber is finite. That is, 

mdeg(v?) = index of a . 

ae</p-i(p) 

Proof of Lemma \2. 9{ The zeroes of the rational function ip := ///' coincide with the zeroes 
of /. Suppose /(a) = so that a lies in y9^^(0). The lemma will follow once we show that 
a has index +1. Then we may write f{t) = (t — aYh{t), where h is a polynomial with 
h{a) ^ 0. We see that f'{t) = d{t - aY'^hit) + (t - aYh'{t), and so 

_ fit) {t-a)h{t) t-a 



fit) dh{t) + {t- a)h'{t) d ' 

the last approximation being valid for t near a as h{t) ^ 0. Since d is positive, we see that 
the index of the point a in the fiber y?-^(0) is +L 

Proof of Theorem \2.^ Suppose first that ip and are rational functions with no common 
poles. Then 

mdeg(v9 + ip) = mdeg(v9) + mdeg('i/') . 

To see this, note that (y? + ^/')~^(oo) is just the union of the sets (p~^{oo) and ^/'"^(oo), and 
the index of a pole of ip equals the index of the same pole of {ip + ip). 

Next, observe that mdeg(<y9) = — mdeg(l/v9). For this, consider the behavior of ip and 
!/(/? near the level set 1. If <y9 > 1 than l/ip < 1 and vice- versa. The two functions 
have index at the same points, and opposite index at the remaining points in the fiber 
^-\1) = {1/V)-\1). 
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Now consider the mapping degree of (p = f/g as we construct its continued fraction 
expansion. At the first step / = fog + h, so that (p = fo + h/g. Since /o is a polynomial, its 
only pole is at oo, but as the degree of h is less than the degree of g, h/g does not have a 
pole at oo. Thus the mapping degree of ip is 

mdeg(/o + h/g) = mdeg(/o) + mdeg(^) = mdeg(/o) - mdeg(f ) . 

The theorem follows by induction, as mdeg(/o) = [fo]. '^X^ 

We close this chapter with an application of this method. Suppose that we are given 
two polynomials / and g, and we wish to count the zeroes a of f where g{a) > 0. If 
g = [x — b){x — c) with b < c, then this will count the zeroes of / in the interval [b,c\, 
which we may do with either of the main results of this chapter. The question is much more 
general, and it is not a priori clear how to use the methods in the first two sections of this 
chapter to solve this problem. 

A first step toward solving this problem is to compute the mapping degree of the rational 
function 

/ 

We consider the indices of its zeroes. First, the zeroes of (p are those zeroes of / that are 
not zeroes of g, together with a zero at infinity if deg(5') > 1. If /(a) = but g{a) ^ 0, then 
f = {t — aYh{t) with h{a) ^ 0. For t near a, 

pit) 



d- g{a) 



and so the preimage a E ip ^(0) has index sign ((7(a)). If deg{g) = e > 1 and deg(/) = d 
then the asymptotic expansion of p> for t near infinity is 

ip{t) 



e-l 



dget' 

where g^ is the leading coefficient of g. Thus the index of 00 G p>~^{0) is sign (5fe)(e— 1 
mod 2) = [g']. We summarize this discussion. 



Lemma 2.11 If deg{g) > 1, then 

sign{g{a)) = mdeg(v9) - [g] 



{a|/(a)=0} 



and if deg{g) = 1, the correction term —[g'] is omitted. 

Since mdeg{p) = — mdeg(l/v5), we have the alternative expression for this sum. 

Lemma 2.12 Let qi,q2, . . . ,qk be the successive quotients in the Euclidean algorithm applied 
to the division of f g by f . Then 

sign ((7(a)) = fe] - N + ■ ■ ■ + (-l)lgd • 

{a|/(a)=0} 
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Proof. We have 

/ f'9 
mdeg— = -mdeg— = -[qi] + [^2] h {-l)''[qk] , 

by Theorem 12. 8[ Note that we have f'g = qif + ri. If we suppose that deg(/) = d and 
deg{g) = e, then deg(gi) = e — 1. Also, the leading term of q is dg^, where g^ is the leading 
term of g, which shows that [gi] = [g']. Thus the lemma follows from Lemma 12.1 when 
deg{g) > 2. 

But it also follows when deg{g) < 2 as [qi] = in that case. '^X^ 
Now we may solve our problem. For simplicity, suppose that degg > 1. Note that 



i(sign {g'^{a)) + sign{g{a)) 

And thus 



1 if g{a) > 
otherwise 



#{a I /(a) = 0, g{a) > 0} = ^mdeg (^^1 + ^mdeg (^jj^ 



which solves the problem. 
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Chapter 3: 

Sparse Polynomial Systems 



Consider a system of n polynomials in n variables 

. . . = /2(xi, . . . = ■ 



fnij^li ■ ■ ■ 1 ■^n) 







(3.1) 



where the polynomial /j has total degree di. By Bezout's Theorem [lOj, this system has at 
most did2 - ■ ■ dn isolated complex solutions, and exactly that number if the polynomials are 
generic among all polynomials with the given degrees. 

Polynomials in nature (e.g. from applications) are not necessarily generic — they often 
have some additional structure which we would like our count of solutions to reflect. 

Example 3.1 Consider the system of two polynomials in the variables x and y 



x^y — xy^ — xy + 2 = . 



3y + 3x + 1 



and 



/ := x'^y + 2xy'^ + xy — 1 = and g : = 

Since these equations have the algebraic consequences, 

f ■{y-x + l)+g-{x + 2y + l] 
f ■ {3y' - 3xy^ + 5y^ - 2xy + 2y - 3) 

+ g ■ {6y^ + 3xy^ + 7y^ + 2xy + 2y + 3) = 9y^ + 9y^ + 2y + 9, 

we see that the system has three solutions. (The linear consequence allows us to recover / 
and g from the cubic in y.) 

Both polynomials / and g have degree three, but they only have three common solutions, 
which is fewer than the nine predicted by Bezout's Theorem. The key idea behind this deficit 
of 6 = 9 — 3 is illustrated by plotting the exponent vectors of monomials which occur in the 
polynomials / and g. 



3'. 



x^y 




(2,1) 




xy'^ 




(1,2) 


2 


xy 




(1,1) 


1 


1 


-H- 


(0,0) 




(3.2) 



The Newton polytope of / (and of g) is the convex hull of these exponent vectors. This 
triangle has area | and is covered by three lattice triangles. We will see why this number of 
lattice triangles equals the number of solutions to the original system. 
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3.1 Kushnirenko's Theorem 



The polynomial system in Example 13. II is a sparse system whose support is the set of integer 
points in the triangle of f l3.2p . Let ^ C Z" be a finite set of exponent vectors which affinely 
spans M". A sparse polynomial f with support ^ is a linear combination 

of monomials from A. Let A_4 C M" be the convex hull of the vectors in A. Let T := : = 
C — {0} be the algebraic torus. We recall Kushnirenko's Theorem from Chapter 1. 

Kushnirenko's Theorem. A system (13. ip of n polynomials in n variables with common 
support A has at most n!vol(A^) isolated solutions in T", and exactly this number if the 
polynomials are generic given their support A. 

We will give two proofs of this result, one using algebraic geometry that is due to Kho- 
vanskii and another which is algorithmic. Each proof introduces some important geometry 
related to sparse systems of polynomials. 

It is worth remarking that while sparse polynomials occur naturally — multilinear or 
multihomogeneous polynomials are an example — they also occur in problem formulations 
due to human psychology. Most of us are unable to write down or reason with polynomials 
having thousands of terms, and we instead seek problem formulations with fewer terms. 

3.1.1 The geometry of sparse polynomial systems 

Consider the map 

where P^^ is the projective space with homogeneous coordinates [zq | a G A\ indexed by A. 
This map factors 

T" — > — » T^/(5(T) c P-^, 

where T"^ = (C^)'"^' is the torus with coordinates indexed by A and 5(T) C T"^ is the 
diagonal torus. The quotient T-^/5(T) is the dense torus in the projective space P'^. It 
consists of those points [z^ | o £ -4.] G P"^ with no coordinate zero. Notice that Lp^ is a 
homomorphism into this dense torus. It is often convenient to identify A with the n x |^| 
matrix whose columns are the exponent vectors in A. 

Example 3.2 Suppose that A consists of the seven exponent vectors (0,0), (1,0), (0,1), 
(1, 1), (—1,0), (0, —1), and (—1, —1). Here is the corresponding matrix, 

/ 1 1 -1 -1 \ 
\^ 1 1 -1 -1 ■ 
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The convex hull A_4 of these points is the hexagon, 




and the map Lpjs, is 

: {x,y)eT^ ^ [l,x,y,xy,x-\y-\x''y-']e¥-^ = 4T> 
Consider a linear form A on P-^, 

A = CgZa . 

Its pullback V5^(A) along ip^^ is a polynomial with support A, 

a&A 

This provides a bijective correspondence between linear forms on P-^ and sparse poly- 
nomials with support A. Under (yj^, the zero set of a sparse polynomial is mapped to a 
hyperplane section of (^^(T") (the hyperplane is where the corresponding linear form van- 
ishes). Since n general linear forms on P-^ cut out a linear subspace L of codimension ra, 
a general system fl3.ip of sparse polynomials with support A is the pullback along of a 
codimension n linear section of <^^(T"). That is, it equals Lp'^{L) or f/)^"^ (</}(¥"■) n L). 

The closure of the image of is the toric variety^ X_4 parameterized by the monomials 
A. Since ^p_A is a homomorphism, the number d_A of solutions to a general sparse system 
with support A is the product 

dA = I ker(v?^)| ■ deg(X^) . (3.3) 

Indeed, if L is a general linear subspace of P-^ of codimension n, then Bertini's theorem 
implies that y9_4(T"') flL = X^^nL and this intersection is transverse^. When the intersection 
is not transverse, d_A will be the sum of the multiplicities of the solutions. The number of 
points in such a linear section is the degree deg(X_4) of X_4, and each point pulls back under 
^Pjs, to I ker((y9_4)| solutions of the sparse system corresponding to the linear section. 

^^There are competing notions of toric variety. We follow the convention from symbolic computation |122) . 
rather than from algebraic geometry |40] . In particular, we do not assume that Xj( is normal. 

^This transvcrsality will also follow from the arguments given in our second proof of Kushnirenko's 
Theorem. 
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3.1.2 Algebraic-geometric proof of Kushnirenko's Theorem 

We prove Kushnirenko's Theorem by showing that 

n!-vol(A^) = |ker(v5^)| • deg(X^) = d_A . 

This proof is due to Khovanskii [64j The same idea of proof is used by Khovanskii and 
Kaveh [61], where they extend the theory of Newton polyhedra for functions on to 
convex bodies associated to functions on an arbitrary affine variety. 

We first determine the kernel of the map <y9^, which is the composition 

jn — ^ jA — ^ T-^/S(T) C P-^ 
X I — > {x"" \ a e A) . 

To facihtate this computation, we assume that & A. This is no loss of generality, for if 
^ A, then we simply translate A so that one of its exponent vectors is the origin. This has 
the effect of multiplying each point in ip_A^(T"') by a scalar, and so it does not change X_4. It 
also multiplies each polynomial in (13. ip by a common monomial, which affects neither the 
solutions in T" nor their number (i_4. By the relation (13. 3p . this translation does not change 
the cardinality of the kernel of ifji^. 

Suppose that G ^. Then the Zo-coordinate of ipj^ is constant {x^ = 1) and so the map 
which sends x G T" to (x" | a e A) maps T" into 1 x C T-^, where \A\ = N + 1. The 
composition of the two maps 

1 X — ^ — y T^/(5(T) (c P"^) 

is an isomorphism. Thus it is sufficient to compute the kernel of the map 

: T" , 

X I — > {x"" \ a e A) , 

when e ^. 

Let ZA C Z"' be the sublattice spanned by the exponent vectors in A. It has full rank, 
by our assumption that A affinely spans M". Then the quotient Z"/Z^ is a finite abelian 
group whose order is the lattice index [Z" : ZA\. Its group Hom(Z"/Z^, T) of characters 
(homomorphisms to T) is also called its Pontryagin dual. We have the sequence of abelian 
groups, 

— > ZA — > Z" %"I%A — y 0. 
This gives rise to the sequence of Pontryagin duals, 

— > Hom(Z"/Z^,T) — > Hom(Z",T) — > Hom(Z>l, T) , 

which is exact in that the image of Hom(Z"/Z^, T) in Hom(Z", T) is the kernel of the last 
map. If we identify Hom(Z"', T) with T", then this sequence of groups, or direct calculation, 
shows that 

Hom(Z"/Z^,T) = {x e I x'^ = 1 Va e ^} = ker(^^) . 
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Thus ker((y9^) is the Pontryagin dual to the quotient JJ^ j'LA. Since Z"/Z^ is a finite abehan 
group, we see that 

|ker(y?^)| = [Z":Z^] = |Z"/Z^| . (3.4) 

The Hubert polynomial hx{d) a projective variety X is the polynomial which is even- 
tually equal to the dimension of the dth graded piece of the homogeneous coordinate ring 
C[X] of X, 

hx{d) = dimcCd[X], for all c? sufficiently large. 

The Hilbert polynomial contains many numerical invariants of X. For example, the degree 
of the Hilbert polynomial is the dimension n of X and its leading coefficient is deg(X)/'n.!. 
For a discussion of Hilbert polynomials, see Section 9.3 of [22]. 

We determine the Hilbert polynomial of the toric variety X_4. For this, it is helpful to 
consider the homogeneous version of the parametrization map (pj^. We lift .4 C Z" to a 
homogenized set of exponent vectors C 1 x Z" by prepending a component of 1 to each 
vector in A. That is, 

^+ := {(l,a) \ aeA}. 

The matrix A'^ is obtained from the matrix A by adding a new first row of Is. For the 
hexagon of Example 13. 2[ this is 

/ 1 1 1 1 1 1 1 \ 
^+=0110-1-1 . 
\ 1 1 -1 -1 / 

Here is the lifted hexagon, where the first coordinate is vertical. 




The map </9_4+ on has the same image in P-^ as does ifj^. The advantage is that the 
image of the map ip_A+ in C"^ is stable under multiplication by scalars — this is built into 
it via the new first coordinate of A'^. If we let t be the first coordinate of f^'^", then the 
puUback of the coordinate ring of P"^ to the ring of Laurent polynomials (the coordinate 
ring of T"^"*"") is 

Sa := Cltx" \aeA] - C[N^+] . 

This is also the homogeneous coordinate ring of the toric variety X^ = V^^(T^"''") (the closure 
taken in P-^). 

The grading on is given by the exponent of the variable t. It follows that the dth 
graded piece of has a basis of monomials 

{t'^-x'' \ae dA} , 
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where dA is the set of (i-fold sums of vectors in A. This is just the set of integer points in 
(iA^ which he in the nonnegative integer span of A. If we let Hys,{d) be the dimension 
of the dth graded piece of the homogeneous coordinate ring of (also called the Hilbert 
function of X^), then these arguments show that 

= |c/A^nN^|. 

We will estimate this Hilbert function, which will enable us to determine the leading coeffi- 
cient of the Hilbert polynomial, as the Hilbert function and Hilbert polynomial agree for d 
sufficiently large. 

Let A ~ Z" be a lattice in M" and A be a polytope with vertices in A. Ehrhart [25] 
showed that the counting function 

Pa : ^3 d I — > \dAr\A\ 

for the points of A contained in positive integer multiples of the polytope A is a polynomial 
in d. This polynomial is called the Ehrhart polynomial of the polytope A, and its degree is 
the dimension of the affine span of A. When A has dimension n, its leading coefficient is 
the volume of A, normalized so that a fundamental domain of the lattice A has volume 1. 
That is, it is the Euclidean volume divided by the lattice index [Z" : ZA]. When A = Z™, 
this is the ordinary Euclidean volume of A. 

Now suppose that A = A_4, the convex hull of A. Since dA C c?A^ fl IjA, if A = ZA, 
we have 

Pa Ad) > H^d). (3.5) 
We give a lower bound for H^{d). Let B be the points b in T^A which may be written as 

b = ^CaO, 

where Ca is a rational number in [0, 1). Fix an expression for each 6 G i3 as an integer linear 
combination of elements of A, and let —N with > be an integer lower bound for the 
coefficients in these expressions for the finitely many elements of B. 

For d > N\A\ we claim that translation by the vector N'^^^j^a defines a map 

Z^n (rf- A^|^|)A^ — y NAndA^. 

Indeed, a point v G ZAn^d — N\A\)Aji^ is a nonnegative rational combination of the vectors 
in A. Taking fractional parts gives v = b + c, where b E B and c G NA. Adding ^aeA ^ 
to the fixed integral expression of b gives a positive integral expression, which proves the 
claim. This shows that 

HA{d) > PAAd-N\A\). 

If we combine this estimate with (13. 5p . and use the result that the Hilbert function equals 
the Hilbert polynomial for d large enough, then we have shown that the Hilbert polynomial 
/i^ of X_4 has the same degree and leading coefficient as the Ehrhart polynomial Paa- 
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Thus the Hilbert polynomial has degree n and its leading coefficient is vol(A^)/[Z" : Z^], 
which is the normalized volume of the polytope A_4 with respect to the lattice IjA. We 
conclude that the degree of Xj^ is 

vol(A^) 



[Z" : ZA] ■ 

Recall (13 .4^ that the kernel of ifjs, has order [Z": 'LA\. Then the formula fl3.3p for the 
number d^^ of solutions to a sparse system (13. ip with support A becomes 

dj, = |ker(y,^)|-deg(X^) = [Z^ : ZA] ■ n\ = n!vol(A^), 

which proves Kushnirenko's Theorem. 



3.2 Algorithmic proof of Kushnirenko's Theorem 

We present a second proof of Kushnirenko's Theorem whose advantage is that it introduces 
more geometry connected to toric varieties (which will be useful in subsequent chapters), 
in particular, the proof uses toric degenerations (sometimes called Grobner degenerations). 
These toric degenerations underlie the method of Viro [130[ 1121] . which is important for 
many constructions in real algebraic geometry. We first analyze the simplest case of Kush- 
nirenko's Theorem — when |^| = n + 1. 

3.2.1 Kushnirenko's Theorem for a simplex 

Suppose that |^| = n + 1 so that A_4 is a simplex with vertices A. Then X_4 = = P-^. 
As we saw, the solutions to any sparse system (13. ip with support A have the form 93^^ (L), 
where L C P" is the codimension-n plane cut out by the linear forms which define the 
polynomials of the system. In this case L is simply a point /3 G P" (which lies in the dense 
torus as the equations are general), and the solutions have the form (p^^{f3). Since (/9_4 is a 
homomorphism to the dense torus of P"^, these solutions form a single coset of ker^ipX). 

We may determine these solutions explicitly. Assume that E A. Since |^| = n + 1, we 
may write the sparse system (13. ip as 

C- (x'^i,x"%...,x"")'^ = 6, (3.6) 

where A — {0} = {oi, 02, ... , a„}, C is the nhj n matrix of coefficients, and b G C". If our 
system is generic, then C is invertible and we may perform row operations on C and hence 
on the system (13. 6p to obtain an equivalent binomial system 

x"^ = I3i for z = l,...,n. (K^ ' 

where . . . G T, and so the system has the form Lp^^((3). (In fact, the requirements 
that C be invertible and that the resulting constants /3j G T are the conditions for genericity 
of this system.) 

Let A be the n by n matrix whose columns are the exponent vectors in ^ — {0}. We 
will see how the integer linear algebra of the matrix A is used to solve the system ([3]6])'. 
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Example 3.3 Consider the system of equations with support A = {(16, 14), (22, 18), (0, 0)} 

23x^^y^^ - = -27, and 

35x''y'^ - x^Y' = -9. ^^'^^ 
Subtracting the two equations gives the binomial, 

12x^%^^ = 36 or x^%^^ = 3, 
and solving for x^'^y^^ gives x'^'^y^^ = 96. Thus we have the equivalent binomial system 

x^S^" = 3 and = 96. (3.8) 

Under the invertible substitution (the inverse is given by m = x^y"^ and v = {xy)~^) 

X = uv^ and y = u'-^ir^ , (3.9) 
our equations become triangular 

{uvy{u-'v-'y^ = = ^2 _ 3^ 

(mV^)22(m-1|;-8)18 = u22t;154^-18^-144 ^ ^4^10 ^ gg _ (-^-^Uj 

While the solution is now immediate via back substitution, we make one further simplifying 
substitution. Write / for the first equation and g for the second. Replacing g by gf~^ yields 
a diagonal system which is now completely trivial to solve 

f : =3, 
gf-^ : i;io = 32/3. 

The solutions are u = ±y/3 and v = CV^, where ( runs over all 10th roots of |. Substituting 
these into (13. 9p . gives the 20 solutions to our original system of equations (13. 7p . 



Underlying these simplifications is the relation between the integer linear algebra of n 
by n matrices and (multiplicative) coordinate changes in T". Since T" = Hom(Z",T), its 
automorphism group is GL{n,Z), the group of invertible n hj n integer matrices, and this 
is the source of that relation. 

Example 3.4 The monomials in (13. 8p correspond to the columns of the matrix 

A - (^^ 22 
^ ~ 1^14 18 

and the coordinate change ( 13. 9 p corresponds to left multiplication (hence row operations) 
by the matrix 
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Indeed, 

A -1\ /16 22\ _ /2 4\ 
\7 -s) \U 18y) ~ 1^0 lOyl ' 

which corresponds to the exponent vectors in the triangular system f l3.10p . This upper 
triangular matrix is the Hermite normal form of the matrix A — the row reduced echelon 
form over Z. This notion makes sense for matrices whose entries lie in any principal ideal 
domain. 

Multiplicative reductions using the equations correspond to multiplicative coordinate 
changes in the target torus and are represented by column operations, or right multipli- 
cation by integer matrices. Indeed 

^0 10^^0 1 J \0 lOj ' 
which is the Smith normal form of the integer matrix A. 

The Smith normal form of an by n matrix A is the diagonal matrix with entries 
di,d2, ■ ■ ■ ,dn where di is the greatest common divisor of all i hj i subdeterminants (minors) 
of A. These are called the invariant factors of A, and we have di\d2\ - ■ ■ \dn- Then 

Z''/ZA ~ Z/diZ X Z/d2Z x ■■■ x Z/d^Z, 

the canonical way to write a finite abelian group. As with the Hermite normal form, the 
Smith normal form makes sense for presentations of modules over any principal ideal domain. 

3.2.2 Regular triangulations and toric degenerations 

Let w: ^ — 7- N. This is a weight for T acting diagonally on the space C-^, if z = {za \ a E 
A) G C^^, then t.z = {t'^^"'^Za \ a E A). This induces a similar action on P-^, and a dual 
action on the homogeneous coordinate ring C[za\aeA] of P-^, namely 

t.Za = t-'^^'^ha for a G ^, 

where Za is a variable (coordinate function on C'^). Let Xj^ C C x P-^ be the closure of the 
family over T of deformations of the toric variety X_4 under this action, 

:= {{t, t.z) cCxP-^ltGT, ze Xj^} . 

This is a flat family over C p9l Ch. 15]. (This technical fact implies that all fibers have the 
same Hilbert polynomial.) The fiber of over a point t G T is the translated toric variety 
t.X_A, and the fiber over G C is called the sclieme-tlieoretic limit of the family t.X_A, which 
is written 

limt.X/i . 

We use geometric combinatorics to study this limit. The passage from X_4 to such a scheme- 
theoretic limit of an action of T is a toric degeneration. 
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We may also use the weight w: ^ — )■ N to hft the vector configuration A into M^^". 
Consider the convex hull in Mi+" of the lifted vectors 

Pi^ := conv{(u;(a), a) | a E A} . (3-11) 

The lower facets of this polytope are those facets of P^j whose outward-pointing normal 
vector has a negative first coordinate. Projecting these lower facets back to M" gives the 
facets in the regular polyhedral subdivision A^^ of the convex hull of A induced by the lifting 
function u. The vertices in this subdivision are some of the vectors in A. While much of 
the following makes sense for general polyhedral subdivisions, we shall henceforth assume 
that is a triangulation in that each lower facet is a simplex. (This is a mild genericity 
assumption on u.) We display the lower facets and the resulting triangulation for a 
weight function cu on where ^ = {(0, 0), (1, 0), (0, 1), (2, 0), (1, 1), (0, 2)}. 




Exponent vectors a of monomials in the coordinate ring C [^^ | a G ^] of P-^ are elements 
of N"^, and u induces a linear form on N"^, 



u ■ a 



(oa \ a e A) = ''^u{a)aa ■ 



The initial form irii^^g) of a homogeneous form g{z) G C[za | a G ^] is the sum of the 
terms Cq,^" of g for which a; ■ a is maximal among all terms of g. Let uj{g) be this maximal 
value. Multiplying t.g by f^^^^ we see that 



t''^^\t.g{z)) = m^g{z) + h. 



where h is divisible by t. Thus 

\im t^^3\t.g{z)) = in^g{z). 

i— 5>0 

Write for the ideal defining X^. (This toric ideal has a linear basis of binomials z" — z^ 
such that A'^a = where A is the matrix whose columns are the exponent vectors in 

A'^ |122[ Lemma 4.1].) These binomials have the following geometric interpretation. The 
product A'^a is a positive linear combination of the vectors in A'^, so binomials z"' — z^ in J_4 
record vectors in NA that have two distinct representations as positive linear combinations 
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of the vectors in A'^. If we divide by the the initial coordinate of the vector A'^a, we obtain 
a vector of the form (1, x), where x is a rational point lying in the convex hull of A. When 
A~^a = A'^P, this pont x has two distinct rational representations as a convex combination 
of elements of A. Thus x lies in the convex hulls of two different subsets of A. Conversely, 
any such point gives rise to a binomial z"' — & Ij^^. 

When oj ■ a > u ■ P, then 2;" is the initial term in^(z" — z^). The initial ideal of /_4 is 

it^Ma) = {i'^M) \ 9 ^ Ia}- 
Since the ideal /(t.X^) of t.X_4 is 

lit.Xj,) = {t.g{y)\geI{Xj,)} = U^, 

we see that 

limt./^ = in^(/^) . 

t— >o 

Thus this initial ideal is the ideal of the scheme-theoretic limit of the family t.Xj^. 

3.2.3 Kushnirenko's Theorem via toric degenerations 

Since the family — C is flat, every fiber has the same degree, and so Kushnirenko's 
Theorem follows if we can prove that the degree of the limit scheme mi^{Xj) is n!vol(A^) 
divided by the degree [Z" : ZA] of ip^i^. It is proved in Chapter 8 of |122] that 

V^^Ma) = f]{za I a^r) , (3.12) 

T 

the intersection over all facet n-simplices r of the regular triangulation of A_4. This result is 
not so hard. If the segment ab for a, 6 G ^ is not a face of the triangulation A,^, then it crosses 
a minimal face conv(r) of the triangulation. This implies that there is a binomial z^'^ — z"' , 
where M, N are positive integers and the monomial z"' involves the variables in r. By the 
construction of the triangulation A^^, the corresponding lifted segment (ci;(a), a), {uj{h), h) of 

lies above the lift of the face conv(r), and thus the initial term of this binomial is z^z^ , 
and so ZaZf, lies in the radical \/iikJXAj of the initial ideal. 

It follows that the limit scheme in^{XX) is supported on the union of coordinate n-planes 
P^, one for each facet n-simplex r of the regular triangulation. (Here is the coordinate 
plane which is spanned by the coordinates indexed by r.) The degree of this initial scheme 
is then 

deg(in<^(X^)) = ^ multpr(in^(X^)) , 

T 

the sum over these facets r of the algebraic multiplicity of the limit scheme mi^{XjC) along 
the coordinate n-plane P"^. 

In |122[ Chapter 8], and under the (mild) assumption that A is primitive CZA = Z"), 
Sturmfels shows that this multiplicity is n!vol(AT-). Since these facets cover A^, the degree 
of the limit scheme in^(X^) is n!vol(A^). As the family is fiat, this degree is the degree 
of Xj\^, and we may deduce Kushnirenko's Theorem from this. 

Before we continue, we deduce a corollary from this. A triangulation of a polytope in 

is unimodular is every facet has minimal volume l/n\. 
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Corollary 3.5 Suppose that A^^ is a regular unimodular triangulation. Then the limit 
scheme of the corresponding flat toric degeneration is a union of coordinate n-planes, one 
for every facet r of A^^, 

limt.x^ = U P^. 

We give a different proof which is adapted from [105] and does not assume primitivity. 
Its limitation is that it only works over C as it uses metric properties of C and not an 
arbitrary algebraically closed field. 

The main idea is to fix a general linear subspace L of codimension n in P'^ and consider 
the family of linear sections L fl t.X_4 of fibers of the fiat family ^¥4, for t near 0. This is 
illustrated in Figure 13.11 The subspace L will meet each facet n-plane P"^ in a single point 



pr 




Figure 3.1: Points in L fl t.X_4 near P"^ for small t. 

Pr, and the points of L fl t.X_4 for t small will be clustered near the different points Pr- 

We could determine the number of points clustered near one of the Pr, which is the 
algebraic multiplicity, multpr(intj(X_4)). It is in fact easier to determine the number of 
points in T*^ of the form ip^^ (t~^ .x) , for x a point clustered near one of the Pr and when t 
is small. This is more direct, and it bypasses computing this algebraic multiplicity. This is 
also where we avoid the assumption of primitivity, but must work over C 

Note that in a neighborhood of P^, the linear space L is isotopic to tt^^Ipt), which is a 

fiber of the coordinate projection vr^ : P^^ > P"^. This is a rational map not defined on the 

linear span of coordinates {za \ a ^ t}. It follows that the number of points in T" coming 
from points in the linear section L fl t.X^ near Pr is equal to the number of points in 
coming from points in the linear section ir^-^lpr) fl t.X_A near p^-. This is simply the degree 
of the map which is the composition of the parametrization y9_4 of X_4, the map z ^ t.z on 
P-^, and this projection tTt-, 

T" Xj^ A t.Xj^ ^ P". 

Since multiplication by t is homotopic to the identity and it commutes with the projection 
TTr, we may assume now that t = 1, and so this composition is nothing other than the 
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parametrization of P"^ by the monomials corresponding to integer points of r. The degree 
of this map is the order of the kernel of which is n\ ■ vol(A^). Summing this quantity 
over all facets r of the triangulation shows that there are 

^n!-vol(A^) = n!vol(A^) 

r 

points in which are pullbacks under (y9_4 of the linear section L fl X_4. This completes our 
algorithmic proof of Kushnirenko's Theorem. '^X^ 

The reason that it is algorithmic is that it (more-or-less) counts the solutions to the 
system L fl t.Xjs,, for t small, while also giving enough information on their location and 
structure to determine them numerically so that they may become the input to the polyhe- 
dral homotopy method [52| I129l| for computing the solutions at t = 1. 

Note that this proof also shows that the intersection Xjs, fl L is transverse when L is 
general, and thus gives a proof of Bertini's Theorem in this context. Since, for t small, the 
intersection near p^- may be deformed to the intersection of with the horizontal subspaces 
7r~^(p^), and this is deformed to the system ip~^{j)T-), which consists of n voI(At-) distinct 
points, the general such intersection is transverse. 

3.2.4 A brief aside about real solutions 

Observe that if the triangulation A^^ is unimodular, in that each facet has minimal volume 
l/n\, then near each point there will be exactly one point of L fl t.X^ and one corre- 
sponding solution in T". If both L and t are real, then each p^ and each nearby point in 
t.Xj^ will be real. Since 

r^(Lnt.x^) = {t-\L)nx^, 

and the points in the left hand side are all real, so are the points in the right hand side. This 
right hand side corresponds to a system of real polynomials with support A. This proves a 
theorem of Sturmfels [120] . and gives his argument in a nutshell. 

Theorem 11.81 // a lattice polytope A C Z" admits a regular unimodular triangulation, then 
there exist real polynomial systems with support A fl Z" having all solutions real. 

A more careful analysis, which begins by examining real solutions when |^| = n + 1, 
leads to the more refined result for not necessarily unimodular triangulations that appears 
in Sturmfels's paper. 
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Chapter 4: 
Upper Bounds 



Recall Descartes rule of signs ([25] or Section r2.ip . which gives a bound for the number of 
positive solutions to a univariate polynomial. 

Theorem 12.11 (Descartes's rule of signs) The number, r, of positive roots of a univariate 
polynomial 



counted with multiplicity, is bounded above by the number of variations in sign of the coeffi- 
cients of f , 

#{z \ 1 < i < m and Ci-iCi < 0} < r , 

and the difference between the variation and r is even. 

In (14. ip we assume that < ai < ■ ■ ■ < am and no coefficient q vanishes. 
Thus a univariate polynomial with m + 1 monomials has at most m positive roots. This 
bound is sharp, as the polynomial 



has m + 1 distinct terms and m positive roots. Replacing x by x"^ gives a polynomial with 
m+1 terms and 2m nonzero real roots. 

This chapter and the next will discuss extensions of this Descartes bound to systems of 
multivariate polynomials. 

4.1 Khovanskii's fewnomial bound 

Descartes's rule of signs suggests that the number of real roots to a system of polynomials 
depends not on its degree, but rather on the complexity of its description. D. Bernstein 
and A. Kushnirenko formulated the principle that the topological complexity of a set in M."' 
defined by real polynomials is controlled by the complexity of the description of the poly- 
nomials, rather than by their degree or Newton polytopes. This is exactly what Khovanskii 
found in 1980 with his celebrated fewnomial bound. 

Theorem 4.1 (Khovanskii |65| ) A system of n real polynomials in n variables involving 
l+n+1 distinct monomials will have at most 








(4.3) 



nondegenerate positive solutions. 
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We remark that nondegenerate solutions are isolated, and there are finitely many of 
them. This bound, like other bounds in this part of the subject, considers solutions in the 
positive orthant M" . A consequence of Khovanskii's bound is that for each / and n, there is a 
number X{1, n) which is equal to the maximum number of positive solutions to a system of n 
polynomials in n variables having l+n+1 distinct monomials. A central question in this area 
is to determine the Khovanskii number X{l,n) exactly, or give good bounds. Khovanskii's 
Theorem shows that X{l,n) is bounded above by the quantity f l4.3p . 

A complete proof of Theorem 14.11 may be found in Khovanskii's book [BB] , where much 
else is also developed. Chapter 1 of that book contains an accessible sketch. Benedetti 
and Risler [5l §4.1] have a careful and self-contained exposition of Khovanskii's fewnomial 
bound. We give a sketch of the main ideas in the exposition of Benedetti and Risler, to 
which we refer for further details (this is also faithful to Khovanskii's sketch). We remark 
that Sturmfels has also sketched ( [124^ pp. 39-40] and in |123j ) a version of the proof. This 
omits some contributions to the root count and is therefore regrettably incorrect. 

Khovanskii looks for solutions in the positive orthant M", proving a far more general 
result involving solutions in M" of polynomial functions in logarithms of the coordinates and 
monomials. For this, he took logarithms of the coordinates. Set 

Zi := log(xi) and yj := e""'"-' = x"^ , (4.4) 

where i = 1, . . . ,n, j = 1, . . . , k, and Uj G M" can be real exponents. Consider a system of 
functions of the form 

Fi{zi,...,Zn, yi,...,yk) =0 i = l,...,n, (4.5) 

where each yj = yj{z) is an exponential function e^'°'^ and the Fi are polynomials in n+k 
indeterminates. 

Theorem 4.2 (Khovanskii's Theorem) The number of nondegenerate real solutions to 
the system (14.51) is at most 

(J[ deg F,) -(l + Yl deg i^.) ■ 2(^) . (4.6) 

i=l i=l 



Proof of Theorem \4.1\ Given a system of n real polynomials in n variables involving l+n+1 
distinct monomials, we may assume that one of the monomials is 1. Under the substitu- 
tion (14.41) . this becomes a system of the form (14.51) . where each Fi is a degree 1 polynomial 
in k = I + n variables. Then degF, = 1 and the bound (14.61) reduces to (14. 3p . 

Sketch of proof of Theorem \4.S\ We proceed by induction on k, skipping some technicalities 
involving Sard's Theorem. When k = 0, there are no exponentials, and the system is just a 
system of n polynomials in n variables, whose number of nondegenerate isolated solutions 
is bounded above by the Bezout number, 

n 

UdegF,, 
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which is the bound ( 14. 6p when k = 0. 

Suppose that we have the bound (14. 6 p for systems of the form (14. 5 p with k exponentials, 
and consider a system with k+1 exponentials and one added variable t. 



Gi{z,t) := Fi{zi,...,Zn, yi,...,yk,t-yk+i) 







I = 



1,... 



n 



(4.7) 



t = 1 



The subsystem (14. 7p defines an analytic curve C in M""^^, which we assume is smooth and 
transverse to the hyperplane at t = 1. 

Write Zn+i for t and consider the vector field ^ in M"+^ whose rth component is 



This vector field is tangent to the curve C, and we write = ^n+i for its component in the 
t-direction. An important ingredient in our proof of Theorem 14.21 is a special case of the 
Khovanskii-Rolle Theorem |66l pp. 42-51]. 

Theorem 4.3 (Khovanskii-Rolle Theorem) The number of points of C where t = 1 is 
bounded above by 



where N is the number of points of C where = and q is the number of unbounded 
components of C. 

Proof. Note that varies continuously along C. Suppose that a and b are consecutive 
points along an arc of C where t = 1. Since C is transverse to the hyperplane t = 1, we have 
^t(a) • ^t{b) < 0, and so there is a point c of C on the arc between a and b with ^t{c) = 0. 

The hyperplane t = 1 cuts a compact connected component of C into the same number 
of arcs as points where t = 1. Since the endpoints of each arc lie on the hyperplane t = 1, 
there is at least one point c on each arc with C,t{c) = 0. Similarly, the hyperplane t = 1 
cuts a noncompact component into arcs, and each arc with two endpoints in the hj^erplane 
t = 1 gives a point c with ^t(c) = 0. However, there will be one more point with t = 1 on 
this component than such arcs. 

We illustrate the argument in the proof below. 




(4.8) 



N + q, 




t = 



c 



1 
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The key to the induction in the proof of Khovanskii's formula is to replace the last 
exponential by a new variable. This substitution is omitted in Sturmfels's argument, which 
also does not use the Khovanskii-RoUe Theorem 14.31 Since we have 

dG, OF, , 

\zx, . . . ,2n, 2/1, • • ■,yk,'tyk+i) 



dZr dZr 



' OF, 



j=i yj 

+ ^ (2:1, ■■■,Zn, 2/1, • ■ .,yk,tyk+i) ■ ak+i,r tyk+i , 

dyk+i 

if we set u := tyk+i and define (p{z,u) to be the expression for = ^n+i (14. 8 p written in 
terms of z and u, then the total degree (in zi, . . . , Zn, 2/1, ... , yk, u) of (ptiz, u) is at most 
Er=idegF.. 

Thus number of Theorem 14.31 is the number of solutions to the system 
Fi{zi,...,Zn, yi,...,yk, u) = i = l,...,n 

(4.9) 

4>t{z,u) = 0. 

This has the form (14. 5 p with k exponentials. Given any solution to the system (14. 9p . we use 
the substitution u = ty^^i to solve for t and get a corresponding point c on the curve C with 
^t(c) = 0. We apply our induction hypothesis to the system (14. 9 p (which has k exponentials 
and n+1 equations in n+1 variables) to obtain 



n n n ^ 

N < JJdegF,- (j]degF,) ■ (l + 2^degF,) -2 

1=1 1=1 i=l 



We similarly estimate the number q of noncompact components of C. We claim that this 
is bounded above by the maximum number of points of intersection of C with a hyperplane. 
Indeed, since each noncompact component has two infinite branches, there are 2q points 
(counted with multiplicity) on the sphere S*" corresponding to directions of accumulation 
points of the branches of C at infinity. Any hyperplane through the origin not meeting these 
points will have at least q of these points in one of the hemispheres into which it divides 
the sphere. If we translate this hyperplane sufficiently far toward infinity, it will meet the 
branches giving these accumulation points, and thus will meet C in at least q points. 

Thus q is bounded by the number of solutions to a system of the form 

Fi{zi,...,Zn,yi,...,yk,u) = z = l,...,n 

(4.10) 

/o + hZi + I2Z2 H h InZn + luU = , 

where /o, • • • , ^n, are some real numbers. This again involves only k exponentials, and the 
last equation has degree 1, so we have 



q < JJdegFi-1- (l + ^degF, + l) -2(2) 
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Combining these estimates gives 

n n ^ k ^ k 

N + q < J]degF,-2© . [(^^degFi) ■ (l + 2^degF,) + (2 + ^degFi) . 

i=l i=l i=l i=l 

We can obtain a simpler (but larger) estimate by bounding + g by the number of 
solutions to the single system of equations, 

Fi{zi,...,Zn, yi,...,yk,u) = i = l,...,n 
Fn+i := (/o + hzi + I2Z2 H h InZn + Lu) ■ 4>t{z, u) = . 

By our induction hypothesis, we have 

n+l n+l ^ 

N + q < JJdegF, ■ (l + J]degF,) -2©. 

i=l i=l 

But we saw that has degree at most Y17=i^^S^i + so the number, M, of 

solutions to the system with A; + 1 exponentials is bounded by 

n n n ^ k 

M < N + q < JJdegF, ■ (^1 + J^degFi) ■ (^1 + ^degF, + l + ^degF,) -2(2) 
1=1 1=1 1=1 1=1 

n n 

= JJdegi^^- (l + 5^degF,) ■ 2('^') , 

i=l 1=1 

which completes the proof of Theorem 14.21 

We see that the result of Theorem 14.21 is much more general than the statement of 
Theorem 14. 1[ Also, the bound is not sharp. While no one believed that Khovanskii's 
bound f l4.3p was anywhere near the actual upper bound X{1, n), it was been extremely hard 
to improve it. We discuss the first steps in this direction. 



4.2 Kushnirenko's conjecture 

One of the first proposals of a more reasonable bound than Khovanskii's for the number of 
positive solutions to a system of polynomials was due to Kushnirenko, and for many years 
experts believed that this may indeed be the truth. 

Conjecture 4.4 (Kushnirenko) A system /i = /2 = ■■■ = /« = 0/ real polynomials 
where each fi has rrii+l terms has at most mim2 ■ ■ -rrin nondegenerate positive solutions. 

This generalizes the bound given by Descartes's rule of signs. It easy to use the exam- 
ple (14.21) for the sharpness of Descartes's rule to construct systems of the form 

/l(a;i) = f2{x2) = ■■■ = fn{Xn) = 0, 



43 



which achieve the bound of Conjecture 14.41 

Soon after Kushnirenko made this conjecture, K. Sevostyanov found a counterexam- 
ple which was unfortunately lost. Nevertheless, this conjecture passed into folklore until 
Haas [H] found a simple example of two trinomials (3 = 2 + 1) in variables x and y with 
5 (> 4 = 2 ■ 2) isolated nondegenerate positive solutions. 

10x^°^ + lly^^ - Uy = lOy^'^^ + llx^^ - llx = 0. (4.11) 

There have been other attempts to find better bounds than the Khovanskii bound. 
Sturmfels |120] used the toric degenerations introduced in Chapter [3] to show how to con- 
struct systems with many real roots (the root count depends upon a mixture the geometry 
of the Newton polytopes and some combinatorics of signs associated to lattice points)^. This 
inspired Itenberg and Roy |55] to propose a multivariate version of Descartes's rule of signs, 
which was later found to be too optimistic [75]. An interesting part of this story is told in 
the cheeky paper of Lagarias and Richardson [71] . 

More recently, Li, Rojas, and Wang looked closely at Haas's counterexample to Kush- 
nirenko's conjecture, seeking to obtain realistic bounds for the number of positive solutions 
which depended only on the number of monomials in the different polynomials. For example, 
they showed that Haas's counterexample was the best possible. 

Theorem 4.5 (Li, Rojas, and Wang |74| ) A system consisting of two trinomials in two 
variables has at most 5 nondegenerate positive solutions. 

Dickenstein, Rojas, Rusek, and Shih [25] used exact formulas for ^-discriminants [26] to 
study systems of two trinomials in two variables which achieve this bound of five positive 
solutions. They gave the following example, which indicates how difficult it is to find systems 
with many real solutions. 

Example 4.6 Consider the family of systems of bivariate sextics, 

+ ay^ - y = y^ + bx^ - X = 0, (4.12) 

where a, b are real numbers. When a = b = 78/55, this has five positive real solutions 

(0.8136,0.6065), (0.7888,0.6727), (0.7404,0.7404), (0.6727,0.7888), (0.6065,0.8136). 

We now investigate the set of parameters (a, b) for which this achieves the trinomial bound of 
five positive solutions. This turns out to be a single connected component in the complement 
of the discriminant for this family of systems. This discriminant is a polynomial of degree 
90 in a, b with 58 terms whose leading and trailing terms are 

1816274895843482705708030487016037960921088a^V^ + ■ ■ ■ 56 terms ■ ■ ■ + 

1102507499354148695951786433413508348166942596435546875. 

^Actually, he used the toric proof of Bernstein's Theorem. 
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We display this discriminant in the square [0,4] x [0,4], as well as three successive magnifi- 
cations, each by a factor of 11. The shaded region in the last picture is the set of pairs (a, b) 
for which (14.121) achieves the trinomial bound of five positive real solutions. 




To compare the trinomial bound in [74j to the fewnomial bound ( 14. 3p , note that we may 
multiply one of the polynomials by a monomial so that the two trinomials share a monomial. 
Then there are at most 5 = 2 + 2 + 1 distinct monomials occurring in the two trinomials. 
The fewnomial bound for I = n = 2 is 

X(2,2) < 2(2) ■ (2 + 1)^ = 5184. 

We remark that a trinomial system is not quite a general fewnomial system with I = n = 2. 
Still, the bound of five real solutions lent credence to the belief X(2, 2) is closer to five than 
to 5184 and that Khovanskii's fewnomial bound (14. 3 p could be improved. 

In addition to providing the counterexample to Kushnirenko's conjecture, Sevostyanov 
also established the first result of fewnomial-type. He showed the existence of an absolute 
bound a{d, m) for the number of real solutions to a system 

f{x,y) = 9{x,y) = 0, 

where / is a poljTiomial of degree d and g has m terms. The proof of this result, like his 
counterexample, has unfortunately been lost. This result however, was the inspiration for 
Khovanskii to develop his theory of fewnomials. 

Recently, Avendano pLj, established a precise version of a special case of Sevostyanov's 
theorem. 

Theorem 4.7 Suppose that f{x,y) is linear and g{x,y) has m terms. Then the system 

f{x,y) = g{x,y) = 0, 
has at most 6m — 4 real solutions. 

4.3 Systems supported on a circuit 

Restricting the analysis of Section [3.2. II to real solutions shows that X(0,n) = 1. Recently, 
it was shown that X{l,n) = 1+n. We discuss this here. 

A collection A of n+2 vectors in which affinely spans M" is called a circuit. The 
circuit is primitive if its Z-affine span is all of Z". When & A, this means that Z" = ZA. 
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Theorem 4.8 (Bertrand, Bihan, and Sottile |l9j) ^ polynomial system supported on a 
primitive circuit has at most 2n + 1 nondegenerate nonzero real solutions. 

Theorem 4.9 (Bihan |llj ) A polynomial system supported on a circuit has at most n+1 
nondegenerate positive solutions, and there exist systems supported on a circuit having n+1 
positive solutions. 

This can be used to construct fewnomial systems with relatively many positive solutions. 

Corollary 4.10 (Bihan, Rojas, Sottile |12] ) There exist systems of n polynomials in n 
variables having l+n+1 monomials and at least [j]' positive solutions. 

This gives a lower bound for X{1, n) of [y]', and is the best construction when / is fixed 
and n is large. It remains an open problem to give constructions with more solutions, or 
constructions with many solutions when / is not fixed. 

The construction establishing Corollary 14.101 is quite simple. Suppose that n = fc/ is a 
multiple of /, and let 

. . . ,Xfc) = /2(xi, . . . = ■■■ = fk{xi, . . . ,Xk) = 0, 

be a system with k+2 monomials and k+1 positive solutions. Such systems exist, by Theo- 
rem S31 Write F{x) = for this system and assume that one of its monomials is a constant. 
For each i = 1, let = {yi \ . . . , be a set of k variables. Then the system 

= = ... = = 0, 

has [k + 1)' solutions, kl variables, and 1 + I + kl monomials. When n = kl + r with r <;, 
adding extra variables yi and equations yi = 1 for i = 1, ... ,r gives a system with ([jj + 1)' 
positive solutions and 1 + I + n monomials. 

When I = 1 the fewnomial bound (14.31) becomes 

2C2") . + , 

which is considerably larger than Bihan's bound of n+1. Replacing l+n by n in the fewno- 
mial bound, it becomes equal to Bihan's bound when / = 1. When / = 0, this same 
substitution in (14. 3 p yields 1, which is the sharp bound when / = 0. In Chapter |5l we will 
outline generalizations of Theorems 14.81 and 14.91 to arbitrary /, giving the bound 

X{l,n) < £^2(2)n' (4.13) 
for positive solutions and, when A is primitive, a bound for all real solutions, 

4 

This is only slightly larger — the difference is in the exponents 2 and 4 of e. These are proven 
in [31 Hg. 
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By Corollary EO] and the bound fl413l) . 

X(n,l) < ^-±^2('^)n'. 

This reveals the correct asymptotic information for X{n, I), when / is fixed, X{n, I) = B(n'). 

Theorems 14.81 and 14.91 are related, and we will outline their proofs, following the papers 
in which they appear, where more details may be found. To begin, let 

fl{Xi,X2,...,Xn) = f2{Xi,X2,...,Xn) = ■■■ = /„ (xi , Xs, . . . , X^) = (4.14) 

be a system with support a circuit A. Suppose that E A and list the elements of the circuit 

}. After a multiplicative change of coordinates (if necessary), we may 
assume that Qq = where e„ is the nth standard basis vector. Since the system (I4.14p is 
generic, row operations on the equations put it into diagonal form 

x"-' = Wi + ViX^^ for 2 = 1, . . . ,n. (4.15) 

When A was a simplex we used integer linear algebra to reduce the equations to a very 
simple system in Section 13.2.11 We use (different) integer linear algebra to simplify this 
system supported on a circuit. 



n 



4.3.1 Some arithmetic for circuits 

Suppose that {0, oq, Oi, . . . , a„} C Z" is a primitive circuit. We assume here that it is 
nondegenerate — there is no affine dependency involving a subset. (The bounds in the de- 
generate case are lower, replacing n by the size of this smaller circuit.) After possibly making 
a coordinate change, we may assume that ao = i ■ Sn, where e„ is the nth standard basis 
vector. 

For each i, we may write ai = bi + ki ■ e„, where bi G Z"^^. Removing common factors 
from a nontrivial integer linear relation among the n vectors {bi, . . . , 6„} C Z"~^ gives us 
the primitive relation among them (which is well-defined up to multiplication by —1), 

p n 

^Ajfej = ^ Xibi. 

1=1 i=p+l 

Here, each Aj > 0, and we assume that the vectors are ordered so that the relation has this 
form. We further assume that 

n p 
i=p+l i=l 



Then we have 



and so 



i=l i=p+l 



p n 
i=l i=p+l 
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4.3.2 Elimination for circuits 



Using f l4.15p to substitute for x"' in (14. 16p gives the univariate consequence of ( 14.15P 

p n 

fM := x:!l[{w, + v,xy^ - U [w, + v^x'j'^ . (4.17) 

j=l i=p+l 

Some further arithmetic of circuits (which may be found in [9]) shows that f{xn) has degree 
equal to n!vol(A_4). This is in fact the ehminant of the system. 

Lemma 4.11 The association of a solution x of fl4.15p to its nth coordinate x„ gives a 
bijection between the solutions of fl4.15p and the roots of f f l4.17p which restricts to a bisection 
between their real solutions /roots. 

While / is the eliminant of the system, we do not have a Grobner basis or even a 
triangular system to witness this fact, and the proof proceeds by exphcitly constructing a 
solution fl4.15p from a root x„ of /. 

The upper bound is found by writing / = F — G as in f l4.17p and then perturbing /, 

ft{y) = t-F{y) - G{y). 

We simply estimate the number of changes in the the real roots of ft as t passes from — oo to 
1, which can occur only at the singular roots of ft- While similar to the proof of Khovanskii's 
theorem, this is not inductive, but relies on the form of the Wronskian F'G — G'F whose 
roots are the singular roots of ft- This may also be seen as an application of Rolle's Theorem. 
We note that this estimation also uses Viro's construction for t near and oo. 

These estimates prove the bounds in Theorems 14.81 and 14.91 Sharpness comes from con- 
struction. In [9] Viro's method for univariate polynomials is used to construct polynomials / 
are constructed with 2n+l real solutions, for special primitive circuits. Bihan [11] constructs 
a system with n+1 positive solutions using Grothendieck's dessins d'enfants. 

4.3.3 A family of systems with a sharp bound 

We give a family of systems that illustrate the result of Theorem 14. 81 (actually of an extension 
of it) and which may be treated by hand. These systems come from a family of polytopes 
A C for which we prove a nontrivial upper bound on the number of real solutions to 
polynomial systems with primitive support A := A fl Z". That is, the integer points ^ in A 
affinely span Z", so that general systems supported on A have ?t,!vo1(A) complex solutions, 
but there are fewer than r;,!vol(A) real solutions to polynomial systems with support A. This 
is intended to not only give a glimpse of the more general results in [9], but also possible 
extensions which are not treated in [T3] . 

Let / > A; > and n > 3 be integers and e = (ei, . . . , e„_i) G {0, 1}""^ have at least one 
nonzero coordinate. The polytope A^ ^ C M" is the convex hull of the points 

(0,...,0), (1,0,. ..,0),...,(0,.. .,0,1,0), (0,...,0,/c), (ei,...,e„_i,/). 
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The configuration Al i = ^ fl also includes the points along the last axis 

(0,...,0,1), (0,...,0,2), (0,...,0,A;-1). 

These points include the standard basis and the origin, so i is primitive in that ZA = 11^ . 

Set |e| := e^. Then the volume of A|. ^ is (/+A;|e|)/n!. Indeed, the configuration A^^ ^ can 
be triangulated into two simplices A|, ^ \ {(ei, . . . , e„_i, /)} and A^. ^ \ {0} with volumes kjnX 
and (/ — /c + A;|e|)/n!, respectively. One way to see this is to apply the affine transformation 



n-l 



{X\ , . . . , Xn) 



i=l 



Theorem 4.12 The number, r, of real solutions to a generic system of n real polynomials 
with support A], i lies in the interval 

< r < k + k\e\ +2, 

and every number in this interval with the same parity as I + k\e\ occurs. 

This upper bound does not depend on I and, since k < I, it is smaller than or equal to 
the number / + A;|e| of complex solutions. We use elimination to prove this result. 



Example 4.13 Suppose that n = k = 3, I = 5, and e 

Then the system 

X + y + xyz^ + 1 + z + + = 

x + 2y + 3xyz^ + 5 + 7z + llz'^ + 13z^ = 

2x + 2y + xyz^ + 4 + 8^ + 16z'^ + 32z^ = 

is equivalent to 

X- (5 + 11^ + 23^2 + 4123) = 

y + (8 + 18^ + 38^2 + 72^3) = 

xyz^ ~{2 + Qz + Uz"^ + 30z^) = 



(1,1)- 
















xyz 



^3,5 



And thus its number of real roots equals the number of real roots of 

z^{5 + llz + 23^2 + Alz^){S + 182 + 38^2 + 722^) - (2 + 62 + lAz^ + 30^^) , 
which, as we invite the reader to check, is 3. 



Proof of Theorem 4-12 A generic real polynomial system with support Ali has the form 



n-l 



CijXj + CinX^X^ + fi{Xn) = for 2 = 1 , . . . , ?T, . 

J = l 
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where each polynomial fi has degree k and x'^ is the monomial xl^ ■ ■ ■ x^T/ • 

Since all solutions to our system are simple, we may perturb the coefficient matrix 
{cij)^j^i if necessary and then use Gaussian elimination to obtain an equivalent system 

Xl-gi{Xn) = ■■■ = Xn-l - gn-l{Xn) = x'x^ - = 0, (4.18) 

where each polynomial Qi has degree k. Using the ffist n — 1 polynomials to eliminate the 
variables Xi, . . . ,Xn-i gives the univariate polynomial 

Xn ■ giiXnY' ■ ■ ■ gn-liXnY"-' " 9n{Xn) , (4.19) 

which has degree / + fc|e| = f (A^ ;). Any zero of this polynomial leads to a solution of the 
original system (14.181) by back substitution. This implies that the number of real roots of 
the polynomial (I4.19P is equal to the number of real solutions to our original system (I4.18p . 

The eliminant (I4.19P has no terms of degree m for k < m < I, and so it has at most 
k + k\e\ + 2 nonzero real roots, by Descartes's rule of signs. This proves the upper bound. 

We omit the construction which shows that this bound is sharp. 
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Chapter 5: 

Fewnomial upper bounds from Gale 
dual polynomial systems 

This needs a proper Introduction 

Suppose that we have the system of polynomials, 

v'^w^ — lluvw'^ — 33uv^w + Av'^w + Ibu^v + 7 = 0, 

v'^w^ + 5uv^w-4v'^w- 3u'^v + l = 0, (5.1) 

v'^w'-^ - lluvw^ - Sluv'^w + 2v'^w + Uu'^v + 8 = 0. 

If we solve them for the monomials v'^w^, v'^w, and uvw^, we obtain 

v'^w^ = 1 — u^v — uv'^w , 

v'^w = I — u^v + uv^w , (5.2) 
uvw^ = ^{1 + u^v — Suv'^w) . 

Since 

(uv^wY ■ (f^w^) = u^v^w^ = (u'^v) ■ (y^wY ■ [uvw^) and 

\u v) ■ \y w ) = u V w = [uv w) ■ [v w) ■ [uvw ) , 

we may substitute the expressions on the right hand sides of (15.21) for the monomials v'^w^, 
v'^w, and uvw^ in these expressions to obtain the system 

(uv'^wY ■ (l — u'^v — uv'^w) = (u'^v) ■ (I — u'^v + uv^wY ■ (yy(1 + ""^"^ ~ Suv'^w)) and 
(^u^vY ■ (l — u^v — uv'^wy' = [uv'^wY ■ (I — u'^v + uv^vo^ • (yy(1 + u^v — 3uv'^w)Y ■ 
Writing x for u'^v and y for uv^w and solving for 0, these become 

/ := y'il-x-y) - x(i - x + y)^ (if (1 + x - 3y)) = 0, and 

g ,= ^2(i_^_^)3 _ ^2(1 + (12(1 + ^_3^))2 = 0. 

Figure |5TT] shows the curves these define and the lines given by the linear factors in (15. 3p . 

It is clear that the solutions to (15. 3p in the complement of the lines are consequences of 
solutions to (15. ip . More, however, it true. The two systems define isomorphic schemes as 
complex or as real varieties, with the positive solutions to (15. ip corresponding to the solutions 
of (15. Sp lying in the central pentagon. Gale duality, which generalizes this isomorphism, is 
a first step towards the new fewnomial bounds of [3l |T3] . 

We remark that these new bounds are derived using the general method that Khovanskii 
developed in [65]. However, they take advantage of special geometry (encoded in Gale 
duality) available to systems of polynomials in a way that the proof of Khovanski's bound 
(Theorem 14. ip did not. Their main value is that they are sharp, in the asymptotic sense 
described after Corollary 14.101 
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Figure 5.1: Curves and lines 

5.1 Gale duality for polynomial systems 

Gale duality is an alternative way to view a sparse system of polynomials. It was developed 
in [11] in more generality than we will treat here. Let us work over the complex numbers. 
Let v4 = {0, Oi, . . . , ai+n} C be integer vectors which span M" and suppose that we have 
a system 

/l(x„, . . . = f2{Xi, . . . ,Xn) = ■■■ = fn{Xi, . . . ,Xn) = (5.4) 

of polynomials with support A. As in Section 13.1.11 the solutions may be interpreted geo- 
metrically as <y9^^(L), where is the map 

9?^ : T" — > T'+" C C'+" 

' f I t/j ^ tij ^ • • • ^ iXj J ^ 

and L C C'"^"" is a codimension n plane defined by degree 1 polynomials Ai, . . . , A„ cor- 
responding to the polynomials /j. While we work here in C'"+', but used F-^ = P"+' in 
Section I3.1.H there is no essential difference. 

Suppose that A is primitive in that ZA = Z", so that the homomorphism <y9_4 is injective. 
Then the subscheme of T" defined by (15. 4 p is isomorphic to the subscheme X := (^(T") fl L 
of T'"'""' or C'"*"". If we change our perspective and view X as the basic object, then the 
parametrization (y9_4 of (/)^(T") realizes X (or rather ip^^(X)) as the subscheme of T" defined 
by dMD. 

The main idea behind Gale duality for polynomial systems is to instead parameterize L 
with a map V'p: C' — )■ L and then consider the subscheme ipp^{X) of C', which is isomorphic 
to X. This is in fact what we did in transforming (15. ip into (15. Sp . We will show that ip'^lX) 
is defined in C' by a system of master functions, which we define in the next section. 
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5.1.1 Master functions 



Let Pi{y), . . . ,Pi+n{y) be pairwise non-proportional degree 1 polynomials on C'. Their prod- 
uct riiPid/) — defines a hyperplane arrang ement H in CK Let /3 G be an integer 
vector, called a weight for the arrangement "H. We use this to define a rational function p^, 

p" = p{yY ■■= pi(y)V(y)'^---w+n(z/)''+", 

where (3 = {bi, . . . , bi+n)- This rational function p{y)'^ is a master function for the arrange- 
ment "H. As the components of /3 may be negative, its natural domain of definition is the 
complement M-^ := \ 7/ of the arrangement. 

A system of master functions in with weights B = . . . , /?;} is the system of 
equations in M^, 

P{y)^' = P{yf' = ... = p{yf^ = 1 . (5.5) 

More generally, we could instead consider equations of the form p{y)^ = a, where a G T 
is an arbitrary nonzero complex number. We may however absorb such constants into the 
polynomials Pi{y), as there are l+n such polynomials but only / constants in a system of 
master functions. This may be viewed as the source of the factor in (15. 3p . We further 
assume that the system (15. 5p defines a zero- dimensional scheme in M^. This implies that 
the weights B are linearly independent, and that the suppressed constants multiplying the 
Pi{y) are sufficiently general. 

As with sparse systems, a system of master functions may be realized geometrically 
through an appropriate map. The polynomials Pi{y), ■ ■ ■ ,Pi+n{y) define an affine map 

: — > C'+" 

y I — > {pi{y),P2{y),...,Pi+n{y)) ■ 

This map is injective if and only if the polynomials {l,pi{y), . . . ,pi+niy)} span the space of 
degree 1 polynomials on C', in which case the hyperplane arrangement T-L is called essential. 
The hyperplane arrangement "H is the puUback along ipp of the coordinate hyperplanes Zi = 
in C'"*""", and its complement M-^ is the pullback of the torus T'^" which is the complement 
of the coordinate hyperplanes in C'~^". 

The weights B are saturated if they are linearly independent and span a saturated sub- 
group of Z'^"^, that is, if 7jB = QB fl Z'"*"*^. Linear independence of B is equivalent to the 
subgroup G of the torus T'"*"" defined by the equations 

= z^^ = ... = z^' = 1 (5.6) 

having dimension n and saturation is equivalent to G being connected. (Here, zi, . . . , zi^n 
are the coordinates for C'^".) In this way, we see that the equations (15. 5p describe the 
pullback ip~^{G) of this subgroup G. We summarize this discussion. 

Proposition 5.1 A system of master functions (15. 5 p in M-^ is the pullback along ipp of the 
intersection of the linear space ippi^}) with a subgroup G of of dimension n, and any 
such pullback defines a system of master functions in M-^. When ifjp is injective, it gives a 
scheme-theoretic isomorphism between the solutions of the system of master functions and 
the intersection G fl ipp{£}). 
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5.1.2 Gale duality 

Proposition 15.11 is the new ingredient needed for the notion of Gale duahty. Suppose that 
G C T'"*"" is a connected subgroup of dimension n and that L C C'"*""" is a hnear subspace 
of dimension / not parallel to any coordinate hyperplane. Then the intersection G fl L has 
dimension 0. 

Definition 5.2 Suppose that we are given 

1. An isomorphism y9_4 : — )• G for A = {0,ai, . . . , a;+„} and equations fl5.6p defining G 
as a subgroup of T'+". Necessarily A is primitive and B = {f3i, . . . , is saturated. 

2. A linear isomorphism ^/'p : C' — )■ L and degree 1 polynomials Ai, . . . , A„ on C'"*"" defin- 
ing L. 

Let "H C C' be the pullback of the coordinate hyperplanes of C'^" along ipp. We say that 
the system of sparse polynomials on T" 

¥P*(Ai) = ^*iA2) = ■■■ = ¥;*(A0 = (5.7) 

with support A is Gale dual to the system of master functions 

piy)^' = Piy)"' = ■■■ = p{y)^' = 1 (5.8) 

with weights B on the hyperplane complement M-^ and vice-versa. 

The following is immediate. 
Theorem 5.3 A pair of Gale dual systems ( 15. 7p and ( 15. Sp define isomorphic schemes. 

This notion of Gale duality involves two different linear algebraic dualities in the sense 
of linear functions annihilating vector spaces. In the first, the degree 1 polynomials Pi{y) 
defining the map ipp are annihilated by the degree 1 polynomials Aj which define the system 
of sparse polynomials ( 15. 7p . The second is integral, as the weights B form a basis for the 
free abelian group of integer linear relations among the nonzero elements of A. Writing 
the elements of B as the rows of a matrix, the l+n columns form the Gale dual or Gale 
transform [171 §5.4] of the vector configuration A — this is the source of our terminology. 

If we restrict the domain of to the real numbers or to the positive reals, then we 
obtain the two forms of Gale duality which are relevant to us. Set := M \ {0}, the 
real torus and M> to be the positive real numbers. Suppose that A is not necessarily 
primitive, but that the lattice index [Z" : "LA] is odd. Then : — TJ^" is injective. 
Similarly, if ZB has odd index in its saturation QB fl Z'"*"", which is the group of integer 
linear relations holding on A, then the equations (15. 6p define a not necessarily connected 
subgroup G C T'"*"" whose real points Gr lie in its connected component containing the 
identity. When the linear polynomials Aj of ( 15. 7p and pi of ( 15. 8 p are real and annihilate 
each other, then these two systems — which do not necessarily define isomorphic schemes in 
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and M-^ — define isomorphic real analytic sets in for (15.7P and in tlie complement 
:= \ "Hm of the real hyperplanes defined by the pi for f lS.Sp . 

In the version valid for the positive real numbers, we may suppose that the exponents 
A are real vectors, for if r G ]R> and a G M, then := exp(alog(x)) is well-defined. In 
this case, the weights B should be a basis for the vector space of linear relations holding 
on A, and the degree 1 polynomials Aj and Pi are again real and dual to each other. The 
equations 15.61 for z G M'.^" define a connected subgroup of Mf^" which equals (pj\^{W^). In 
this generality, the polynomial system f l5.7p makes sense only for x G and the system of 
master functions (15.81) only makes sense for y in the positive chamber Ap of the hyperplane 
complement M^, 

Ap := {y eR^ \p^iy)>0 z = 1, . . . , l+n} , 
and the two systems define isomorphic real analytic sets in M" for (15. 7p and in Ap for (15. 8p . 



5.1.3 Algebra of Gale duality 

The description of Gale duality in Definition 15.21 lends itself immediately to an algorithm 
for converting a system of sparse polynomials into an equivalent system of master functions. 
We describe this over C, but it works equally well over M or over M>. Suppose that ^ C 
is a primitive collection of integer vectors and suppose that 

fl{Xi, . . . , Xn) = /2(Xi, . . . = ■■■ = f^{xi,...,Xn) = (5.9) 

defines a zero dimensional subscheme of T". In particular, the polynomials fi are linearly 
independent. We may solve these equations for some of the monomials to obtain 

= 9i{^) ='■ . . . , x'*'+™+") 

; ; (5.10) 

= 9n{x) =: p„(x'^"+^, . . . , a;"'+™+") 

Here, for each i = 1, . . . , n, gi{x) is a polynomial with support {0, a„+i, . . . , ai^n} which 
is a degree 1 polynomial function pj(x""+^, . . . , x°'+") in the given / arguments. For i = 
n+l, . . . , l+n, set pj(x'^"+S . . . , x"'+") := 

An integer linear relation among the exponent vectors in A, 

biai + 62^2 + ■ ■ ■ + k+nai+n = , 
is equivalent to the monomial identity 

(x"i)^^ ■ (x"2)''^---(x"'+")'"+" = 1, 
which gives the consequence of the system (I5.10p 

(pi(x""+\...,x"'+"))'^ ■■■ (pi+„(x»"+\...,x"'+"))''+" = 1. 

Define yi, ... ,yi to he new variables which are coordinates for C'. The degree 1 polyno- 
mials pi{yi, . . . ,yi) define a hyperplane arrang ement H in C'. Let B := . . . , A} C Z'+" 
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be a basis for the Z-module of integer linear relations among the nonzero vectors in A. These 
weights B define a system of master functions 

p{yf' = p{yf' = ... = p{yf^ = 1 (5.11) 

in the complement of the arrangement. 

Theorem 5.4 The system of polynomials (15. 9 p in and the system of master func- 
tions (15. lip in Mfi define isomorphic schemes. 

Proof. Condition (1) in Definition 15.21 holds as A and B are both primitive and annihilate 
each other. The linear forms Aj that pull back along y9_4 to define the system (I5.10p are 

= ~ Pi{^n+1: • • • ; ^/+n) ; 

which shows that condition (2) holds, and so the statement follows from Theorem 15.31 

The example at the beginning of this chapter illustrated Gale duality, but the equa- 
tions (15. 3 p are not of the form p^ = 1. They are, however, easily transformed into such 
equations, and we obtain 

y^{l-x-y) x'^{l-x-yf ^ 



x{\-x + yf (if (1 + X - 3y)) ^2(1 - a; + (if (1 + X - ^y)f 

Systems of the form (15. 3 p may be obtained from systems of master functions by multiplying 
= 1 by the terms of p^ with negative exponents to clear the denominators, to obtain 
pP* = pP and thus p^'^ —p^ =0, where I5± is the componentwise maximum of the vectors 
(0,±/3). 

5.2 New fewnomial bounds 

The transformation of Gale duality is the key step in establishing the new fewnomial bounds. 

Theorem 5.5 A system (15. 4p of n polynomials in n variables having a total of Z+n+1 
monomials with exponents A C M" has at most 

4 

positive nondegenerate solutions. 

If A G and ZA has odd index in Z", then the system has at most 

e'^ + ^r,(') I 
2y2Jn 

4 

nondegenerate real solutions. 
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The first bound is proven in [13] and the second in pj. By Gale duahty, Theorem 15.51 is 
equivalent to the nest Theorem. 

Theorem 5.6 Let pi{y) , . . . , pi+n{y) be degree 1 polynomials on M! that, with 1, span the 
space of degree 1 polynomials. For any linearly independent vectors B = . . . , /?/} C 
the number of solutions to 

1 = p{yf' for j = l,...,l 

in the positive chamber Ap is less than 

4 

If B G Z'"*"" and has odd index in its saturation, then the number of solutions in is less 
than 

4 

Should replace the hypotheses on the degree 1 polynomials by the notion of essential hyper- 
plane arrangement, and have that arise in the discussion of Gale duality. 

We outline the proof of Theorem 15.61 in the next three sections. 

5.2.1 One idea 

The basic idea behind the proof of Theorem 15.61 is to use the Khovanskii-RoUe Theorem, 
but in a slightly different form than given in Theorem 14.31 Using it in this way to establish 
bounds for real solutions to equations was first done in [13]. Given functions gi,...,gm 
defined on a domain D, let V{gi, . . . , gm) be their set of common zeroes. If C is a curve in 
let ubc(C) be its number of unbounded components in D. 

Theorem 5.7 (Khovanskii-Rolle) Let gi, . . . ,gi be smooth functions defined on a domain 
D gM! with finitely many common zeroes and suppose that C := V{gi, . . . ,gi-.i) is a smooth 
curve. Set J to be the Jacobian determinant, det{dgi/dyj) , of gi, . . . ,gi. Then we have 

\V{gi,...,gi)\ < nhc{C) + \V{gi,...,gi.,,J)\. (5.13) 

This form of the Khovanskii-Rolle Theorem follows from the from the usual Rolle The- 
orem. Suppose that gi{a) = gi{b) = 0, for points a, b on the same component of C. Let s{t) 
be the arclength along this component of C, measured from a point to G C, and consider 
the map, 

C — y R'^ 
t ^ {s{t),gi{t)). 

This is the graph of a different iable function g{s) which vanishes when s = s{a) and s = s{b), 
so there is a point s{b) between s{a) and s{b) where its derivative aso vanishes, by the usual 
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RoUe Theorem. But then c hes between a and b on that component of C, and the vanishing 
of g'{s{c)) is equivalent to the Jacobian determinant J vanishing at c. 

Thus along any arc of C connecting two zeroes of gi, the Jacobian vanishes at least once. 




As in the proof of Theorem 14.31 the estimate of (15.131) follows from the observation 
concerning consecutive zeroes of gi along C. 

5.2.2 A generalization and two reductions 

We first make an adjustment to the system of master functions in Theorem 15. 6[ replacing 
each master function p{y)^ = Pi{yY^ ■ ■ ■ Pi+niv)^'''*'" by 

\piy)f ■■=\Piiy)\''---\Pi+n{y)\'"^-. 

For example, if we take absolute values in the system of master functions ( I5.12p . we obtain 
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\y\ 
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X + y\ 




X - 3y)|2 



= 1. (5.14) 



This new system with absolute values will still have the same number of solutions in the 
positive chamber Ap as the original system, since \pi{y)\ = Pi{y) for i = l,...,/+n and 
y G Ap. Its solutions in the hyperplane complement will include the solutions to the 
system of master functions from Theorem 15. 6^ but there may be more solutions. 

We illustrate this for the system f l5.14p in Figure \572\ which we may compare to Figure EI] 
as the system of master functions ( I5.12p is equivalent to the system (15. 3p in the complement 
of the lines. In particular, among the solutions to the system (I5.14p are the three solutions 
to (15. 3 p in the positive chamber (which is shaded) as well as the three solutions to (15. 3p 
outside the positive chamber. The system (I5.14p has four additional solutions outside the 
positive chamber, which are marked in Figure 15.21 

We give a proof of Theorem 15.61 for systems of the form \p{y)\^^ = 1 for j = 1, . . . , /. This 
will imply the bound for systems of master functions. Taking absolute values allows nonin- 
tegral (real number) exponents in \p{y)\^^ = 1, and so we need not require that exponents 
are integral. 

We promised two reductions. 
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Figure 5.2: Curves from absolute values 



1. The degree 1 polynomials Pi{y) are in general position in that the hyperplanes in the 
arrangement "H are in linear general position. That is, any j of them meet in an affine 
linear subspace of codimension j, if j < /, and their intersection is empty if j > /. We 
may do this, as we are bounding nondegenerate solutions, which cannot be destroyed 
if the Pi{y) are perturbed to put the hyperplanes into this general position. 

2. Let B be the matrix whose rows are . . . We may assume that every minor 
of B is non-zero. This may be done by perturbing the real-number exponents in the 
functions \p{y)\^^- This will not reduce the number of nondegenerate solutions. 

Perturbing exponents is not as drastic of a measure as it first seems. Note that in 
the hyperplane complement, \p{y)\'^ = 1 defines the same set as \og{\p{y)\'^) = 0. If 
/3 = [bi, . . . , bi+n), then this is simply 



Expressing the equations in this form shows that we may perturb the exponents. 

We first look at these reductions in the context of the system of master functions fl5.12p . 
The hyperplane arrangement H is an arrangement of lines in which no three meet and no 
two are parallel, and thus they are in general position. The matrix of exponents is 



No entry and no minor of B vanishes. 

Let us now see how the Khovanskii-RoUe Theorem applies to the system fl5.14p of Fig- 
ure restricted to the positive chamber. First, take logarithms and rearrange to obtain 

2 log |a;| — 2 log + 3 log |1— yl — log ||— — 2 log I yy(1+3:— 3y)| = 
log |x| — Slog |?/| + 1 log + Slog + log |YY(l+a^—3?/)| = 



6llog|pi(?/)| + 62log|p2(y)| + ■■■ + 6/+nlog|p/+n(2/)| 



0. 



(5.15) 
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Call these functions fi and /2, respectively. Their Jacobian is the rational function 

2x^ - IQx'^y + 12xy^ + 6y^ - + 26xy - f + fx + - 2 
xy{l — X — y){^—x+y){l + x — 3y) 

whose denominator is the product of the linear factors defining the lines in Figure 15.21 
Clearing the denominator and multiplying by 2, we obtain a cubic polynomial 

J2 := Ax^ - 32x^y + 24y^x + 12y^ - 31x^ + 52xy - 53y^ + 9x + 15y - 4 . 

Its zero set meets the curve Ci (which is defined by fi = 0) in 6 points, five of which we 
display in Figure [573] — the sixth is at (3.69, —0.77). By the Khovanskii-RoUe Theorem, the 




Figure 5.3: Gale system and Jacobian J2. 

number of solutions to /i = /2 = is at most this intersection number, \V{fi, J2)|, plus the 
number of unbounded components of Ci. 

We see that Ci has 14 unbounded components, which gives the inequality 

10 = \V{hj2)\ < ubc(Ci) + |r(/i,J2)| = 14 + 6 = 18. 
5.2.3 Three lemmata 

We follow the suggestion in the second reduction in Section 15.2.21 and replace the master 
functions by the logarithms of their absolute values. For each j = 1, . . . ,1, define 

l+n 

9j{y) ■■= iogb(2/)|^^ = J]Ajiogbi(2/)| , 

i=l 
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where is the ith component of f3j. Observe that both fj = and \p{y)\'^^ = 1 have 
the same solutions in the hyperplane complement M^. For each j = 1, . . . , / , define fij := 

Recall that the connected components of the complement are called chambers. A 
Hat of the arrangement TL is an affine subspace which is an intersection of some hyperplanes 
in "H. By our assumption that the hyperplane arrangement "H is in general position, a fiat 
of "H has codimension j exactly when it is the intersection of j hyperplanes in "H. 

Lemma 5.8 For each j = 1, . . . , l—l, fij is a smooth submanifold of of codimension j . 
The closure JIJ of fij in M} meets the arrangements in a union of codimension j +1 flats. In 
the neighborhood of point on a codimension j+1 flat meeting Jlj, fij has at most one branch 
in each chamber incident on that point. 

Define functions Ji, Ji-i, . . . , Ji by recursion, 

Jj := Jacobian oi gi, . . . , gj, Jj+i, . . . , Ji . 

The following is proven in [T3| § 3.1] using the Cauchy-Binet Theorem. 

l+n 2i-i 

Lemma 5.9 Jj ■ ( j is a polynomial of degree 2^~^ ■ n. 

i=l 

For each j = 1, . . . , /, define Cj := fl V{Jj+i, . . . , Ji). By our assumptions that 
the polynomials gi and the exponents Pj are general, this will be a smooth curve in the 
hyperplane complement M^. 

We now iterate the Khovanskii-RoUe Theorem 15.71 to estimate the number of solutions 
to a system of master functions as in Theorem 15. 6[ 

\Vigi,g2,---,gi)\ < nhc{Ci) + \V{gi, g2, . . . , gi-i, Ji)\ 

< ubc(Q) + ubc(Q_i) + \V{g,,...,gi.2, Ji-i, Ji)\ (5.16) 

< ubc(Q) + ■■■ + ubc(Ci) + |r(Ji, J2,..., Jz)|. 

Here, V{- ■ ■) is the common zeroes in the hyperplane complement M^. Let ubcA(C') count 
the number of unbounded components of the curve C in Ap and Va(- ■ ■ ) be the common 
zeroes in Ap. Then the analog of fl5.16p holds in Ap. 

Lemma 5.10 With these definitions, we have the estimates 

(1) \vM,...,Ji)\ < \v{j,,...,Ji)\ < 2(2)^'. 

(2) ubcA(C,) < i(i+5+").2('^On'-^-. 

(3) ubc(C,) < |(^+'.+") ■2Mn'-^ -2^-. 



61 



The first statement follows from Lemma 15.91 and Bezout's Theorem. For the second, 
nore that Cj = fl V{Jj+i, . . . , J/). Since each unbounded component has two ends, 
we estimate the number of unbounded components by | of the number of points in the 
boundary of A that are limits of points of Cj. By Lemma [5.8[ these will be points in the 
codimension skeleta where Jj+i, ■ ■ ■ , Ji vanish. The bound in the second statement of 15.101 

is simply | multiplied by the product of [^'^''j^"') and 2^ ^^^n^~K That is by the number of 
codimension J fiats in "H (some of which meet the boundary of A) multiplied by the Bezout 
number of the system J,_|_i = ■ ■ ■ = J; = 0. 

Note that the bound in (2) holds for any chamber of M^. We get the bound in (3) by 
noting that in the neighborhood of any codimension j stratum of the complement has at 
most 2^ chambers, and so each point of Cj at the boundary of can contribute at most 
2^ such ends. 

The complement of the hyperplane arrangement consists of many chambers. The 
first bound of Theorem 15.61 is in fact a bound for the number of solutions in any chamber, 
while the second bound is for the number of solutions in all chambers. This is smaller than 
what one may naively expect. The number of chambers in a generic arrangement of 1+n+l 
hypersurfaces in M' is 

r2+n+l\ , (2+n+l\ , , f (^^2 ^0 +1 if 1 + n + / is CVCU 

I I ; + I /-2 ; + ■ ■ ■ + I l + n + l if 1 + n + / is odd ' 

Thus, we would naively expect that the ratio between ubcA(C'j) and ubc(Cj) to be this 
number, rather than the far smaller 2K This is the source for the mild difference between 
the two estimates in Theorem 15. 5[ 

We may combine the estimates of Lemma [5.101 with (15.161) to estimate \Vi^{Ji, . . . , Ji)\ 
and \V{Ji,...,Ji)l 

|V^a(Ji,...,JOI < ^^C+;+")-2(V)n'-. + 2a)n', and 

i=i 

\V{J,,...,Ji)\ < i^(^+;+-)-2Mn'-^-.2^- + 2(2)n'. 
It is not hard to show the estimate [131 Eq.(3.4)] 

, ) - ji 

so that these estimates become 

\VM,---,Ji)\ < f|E;=i^ +1)2(2)^' < ^2(2)^', and 

\v{Ju...,Ji)\ < (ie;=i^ +1)2(2)^' < ^2av. 

This implies Theorem 15.61 and thus the new fewnomial bounds. Theorem 15.51 
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Chapter 6: 

Lower Bounds for Sparse Polynomial 
Systems 

In Chapter [H we mentioned how work of Welschinger |131j , Mikhalkin [79] , and of Khar- 
lamov, Itenberg, and Shustin [561 EZ] combined to show that there is a nontrivial lower bound 
Wd for the number of real rational curves of degree d interpolating 3c/— 1 points in MP^. If 
Nd is the Kontsevich number f ll.6p of complex rational curves of degree d interpolating 3d — l 
points in CP^, then we have 

1. Wd > J, and 

2. limp^ = 1. (Infact, log(iVrf)~3dlog(ci)~log(W^,).) 

d-^oo log(Wrf) 

An exposition of the beginning of this story written for a general mathematical audience 
is given in [113] . Similar results were found by Solomon [103] for rational curves on real 
Calabi-Yau threefolds. For example, there are at least 15 real lines on a smooth quintic 
hyperurface in P^. 

Eremenko and Gabrielov [33] have a similar result for the number of real solutions to 
the inverse Wronski problem. They gave numbers am,p > for < m < p with m+p odd, 
and proved that if $ is a real polynomial of degree mp then there are at least am,p different 
m-dimensional subspaces of the form Span{/i, f2, ■ ■ ■ , fm} where each /j is a real polynomial 
of degree m+p— 1, and their Wronskian Wr (/i, /2, . . . , fm) is a scalar multiple of $. We will 
discuss this in Section 16.51 

While these results are spectacular, they are but the beginning of what we believe will 
be a bigger and more important story (at least for the applications of mathematics). These 
are examples of natural geometric problems possessing a lower bound on their numbers 
of real solutions. It would be a very important development for some applications if this 
phenomenon were widespread, if there were methods to detect when such lower bounds 
existed, and if there were also methods to compute or estimate these lower bounds. The 
point is that nontrivial lower bounds imply the existence of real solutions to systems of 
equations, or to interesting geometric problems. A beginning of the interaction between 
applications and this new theory of lower bounds is found in work of Fiedler-Le Touze [36] 
and discussed in Section [L3] of Chapter [H 

This chapter will report on the first steps toward a theory of lower bounds for sparse 
polynomial systems as given in [105] . There are three papers where one may read more 
about this subject. 

[33] A. Eremenko and A. Gabrielov, Degrees of real Wronski maps, Discrete Comput. 
Geom. 28 (2002), no. 3, 331-347. 
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• Establishes a lower bound for the Wronski map, realized as a degree of its lift to 
oriented double covers. 



|105j E. Soprunova and F. Sottile, Lower bounds for real solutions to sparse polynomial 
systems, Advances in Math., 204 (2006), no. 1, 116-151. 

• Begins the theory of lower bounds to sparse polynomial systems. 

[60] M. Joswig and N. Witte, Products of foldable triangulations, Advances in Math., 210 



• Uses geometric combinatorics to give many more examples of sparse polynomial 
systems with a lower bound on their number of real solutions. 



The last two papers study lower bounds for unmixed systems, such as those covered by 
Kouchnirenko's Theorem. It remains an important open problem to develop a theory for 
unmixed systems such as those which appear in Bernstein's Theorem.''' Example 16.71 at the 
end of this chapter is a first step in this direction. 

6.1 Polynomial systems from posets 

Let P be a finite partially ordered set (posct), whose elements we take to be our variables. 
For convenience, assume that P has n elements. The order ideals / C P are its up sets — 
subsets that are closed upwards. Specifically, a subset / C P is an order ideal if whenever 
X < y in P with x G /, then we must have y ^ L Our polynomials will have monomials 
indexed by these order ideals. For J C P set 



A linear extension w of P is a permutation w: Xi,X2, ■ ■ ■ ,Xn of the elements of P such that 
Xi < Xj in P implies that i < j. Let A(P) denote the number of linear extensions of a poset 
P. The sign- imbalance o"(P) of P is 



the sum over all linear extensions w of P. Here, sgn(w) is the sign of the permutation w. (To 
be precise, this makes sense only if we first fix one linear extension vr of P and measure the 
sign of W7r~^. Taking the absolute value in (16. ip removes the effect of that choice.) Lastly, 
a Wronski polynomial for P is a polynomial of the form 



(2007), no. 2, 769-796. 




(6.1) 




(6.2) 



^Put a statement like this in the chapter on Gale duality! 
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the sum over all order ideals I of P, where cq, ci, . . . , c„ G M. Note that the coefficient of the 
monomial depends only upon its degree, |/|. The reason for the term Wronski polynomial 
is that this comes from a linear projection with a special form that is also shared by the 
Wronski map. (This is discussed just before Section [8.2.11 in Chapter [81) 

Theorem 6.1 Suppose that P is a finite poset in which every maximal chain has the same 
parity. Then a generic system of Wronski polynomials for P has A(P) complex solutions, at 
least cr{P) of which will be real. 

Remark 6.2 A variant of this is to ffist fix real numbers aj for each order ideal / and then 
consider polynomials of the form 

''^^c\i\aix^ . (6.3) 
I 

That is, the coefficients cq, ci, . . . , c„ vary in W^, but the numbers aj are fixed. The same 
statement holds about such systems. In fact, this variant is closest to the Wronski map in 
Schubert calculus. '^X^ 



Example 6.3 Let P be the incomparable union of two chains, each of length 2, 




Here are the monomials corresponding to the order ideals of P. 

{0, X, z, wx, xz, yz, xyz, wxz, wxyz} . 

A Wronski polynomial for P has the form 

C4 wxyz 
+ c-i{xyz + wxz) 
+ C2{wx + XZ + yz) (6.4) 
+ ci(x + z) 

+ Co, 

where the coefficients cq, . . . , C4 are real numbers. 

There are six linear extensions of P, as each is a permutation of the word wxyz where 
w precedes x and y precedes z. The sign-imbalance of P is seen to be 2, as computed in the 
following table. 



permutation 


wxyz 


wyxz 


ywxz 


wyzx 


ywzx 


yzwx 


aiP) 


sign 


+ 




+ 


+ 




+ 


2 



Since P has two maximal chains and each contains 2 elements, it satisfies the hypotheses 
of Theorem 16. 1[ and so we conclude that: 
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A system of four equations involving polynomials of the 
form (16 ■4p has six solutions, at least two of which are real. 

The rest of this chapter will discuss the steps in the proof of Theorem 16.11 the more 
general results contained in |105] . and some examples that illustrate this phenomenon of 
gaps. 

6.2 Orientability of real toric varieties 

Recall from Chapter [3] that a sparse polynomial system with support ^ C is equivalent 
to a linear section fl L of the toric variety X^. In fact, its solutions are 

where ip_A : T"^ 3 x ^-^ [x"" \ a E A] E P"^ is the parametrization map and X^ is the closure of 
its image. 

Issue 1. In order for there to be a bijective correspondence between real solutions of the 
original system and real points in the linear section, we need that the map be injective 
on (M^)", which is the condition that the index [ZA: Z"] be odd. This should be regarded 
as a minor issue and a mild assumption. We have already seen this in Chapters [3] and El 

The key idea at the beginning of this theory is to realize the intersection X_4 fl L as the 
fiber of a map. To that end, let -E C L be a hyperplane in L that does not meet X_4 and 
M ~ P" a linear space that is disjoint from E. Then E has codimension n+1 and the set of 
codimension n planes containing E is naturally identified with M ~ P", as each codimension 
n plane containing E meets M in a unique point. Define the linear projection 

TT = nE : - E — ^ M ~ P'^ (6.5) 

by sending a point p E P"^ — E to the intersection of M with span of E and p. Figure 16.11 

illustrates this in P^, where E and M are lines. We sometimes write tt: P-^ > P", using 

the broken arrow > to indicate that the map is not defined on all of P"^. 

Write TT for the restriction of the linear projection to the toric variety X_4. U y := L H M 
is the point where L meets M, then 

X^nL = n-\y). 

This is also illustrated in Figure 16. H where Xyi is a cubic curve. The reason that we 
reformulate our system of polynomials as the fiber of a map is so that we may use topological 
methods to study its solutions. 

That is, we work over M, define := X^ fl MP-^, and consider the real linear section 

n LiR. Equivalently, we restrict the projection vr further to a map 

/ : F4 ^ MP", 

and consider points in the fiber f~^{p), where p E MP". Since and MP" have the 
same dimension, the map / may have a topological degree. For this, and MP" must be 
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Figure 6.1: A linear projection vr with center E. 



orientible and we must fix orientations of and of MP'^. Then, for every regular regular 
value y G MP" of /, the differential map df is a bijection on the tangent spaces of every 
point X G f~^{y)- Define sign(x) := 1 if the differential preserves the orientations at x and 
y and sign(x) := — 1 if the orientations are reversed. The topological degree of / is the sum 

^ sign(x) . 

This definition does not depend upon the choice of a regular value y, as MP" is connected.''' 
The value of the notion of topological degree is the following.^ 

Theorem 6.4 The number of points in a fiber f~^{y), for y G MP" a regular value of f , is 
at least the absolute value of the topological degree of f . 

Since both MP" and are not always orientable, the topological degree of / is not 
always defined, and so we do not get a bound as in Theorem 16.41 To remedy this, we 
consider also the lift of the linear projection vr to the double covers coming from spheres: 

f+: Y+ C - -- S" 

11^1 (6.6) 

/ : ^4 C MP^ > MP" 

Then the topological degree of the map provides a lower bound for the number of 
solutions, as in Theorem 16. 4[ 'We seek criteria which imply that the map / or its lift 
f~^ to the double covers has a well-defined degree. That is, either or is orientable. 

For this, we use Cox's construction of as a quotient of a torus acting on affine space, 
as detailed in [2ll Theorem 2.1]. (In the symplectic category, this is due to Delzant.) The 

^Givc a reference for this. 

^Reconcile this with the use of degree in Chapter 2. 
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convex hull of A is the polytope A := Ayi. We need the dual description of A in terms of 
intersections of half-spaces, or facet inequalities, which have the form 



A = {zeW I B-z > -6} , 

where B is an integer by n matrix (N is the number of facets of A) whose rows are the 
inward-pointing normals to the facets of A, and b G measures the (signed) lattice distance 
of each facet from the origin. Then there is a natural parametrization : — X_4, 
realizing X_4 as a geometric invariant theory quotient of by a subtorus of T^. (We do 
not give the details here, see [21] or |105] .) 

Issue 2. While ip/\ restricts to a map z/^a : — — ^ (and also to Y^), these maps are 
not necessarily surjective. This map ipA is surjective if and only if the column span col{B) 
of the matrix B has odd index in its saturation, fl (Q (S>z col(i?)). 

When the standard orientation of drops to an orientation either of or of under 
the Cox quotient map V'a, we say that is Cox-ohentable. 

Theorem 6.5 With the assumptions outlined in Issue 1 and Issue 2, if co\{B) + 'L-h contains 
a vector, all of whose coordinates are odd, then is Cox-orientahle. If co\{B) has such a 
vector, then Y4 receives the orientation, otherwise Y^ receives the orientatation. 

Remarks. 

1. Given a projection map vr fl6.5p whose center E is disjoint from Y^, write / for its 
restriction to Y4. Suppose that Y4 is Cox-orientable. Then lift /"*" : Y^ — S*" of / to 
Y^ has a well-defined degree, which is a lower bound for the number of real solutions 
to polynomial systems arising as fibers of the map /. Call this number the real degree 
of the map /. 

2. If P is a poset and A C {0, 1}^ consists of the indicator functions of its order ideals, 
so that a Wronski polynomial for P (16.21) has support A, then it is not hard to show 
that "LA = Z" and also Y4 is Cox-orientable if all maximal chains of P have the same 
parity. 

This parity condition is sufficient, but not necessary, for the Wronski polynomial sys- 
tem on Y4 to have a lower bound. The poset 

I 



I I 

I I ' 
• • 



has two maximal chains of lengths 3 and 4, and so it is not necessarily Cox-oriented. 
Nevertheless, its Wronski polynomial systems have (3) = 35 solutions, at least (^) = 3 
of which are real. See Section 16.41 
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6.3 Degree from foldable triangulations 



These results provide us with a challenge: compute the real degree of a (or any) map / 
arising as a linear projection of a Cox-orientable toric variety Y^. We give a methods that 
uses the toric degenerations of Chapter [3] to provide an answer to this question. It is by no 
means the only answer. 

By Corollary 13. 5[ if A consists of the integer points in a lattice polytope A which has a 
regular unimodular triangulation A,^, then there exists a flat deformation of within the 
projective space P-^ into a limit scheme which is the union of coordinate n-planes 

limt.X. = 1 I P^ 

The idea here is to find conditions on the triangulation that allow us to compute the real 
degree of some map / from this deformation. Some triangulations A^; have a naturally 
defined linear projection with a nicely defined degree. 

Example 6.6 Consider the triangulation of the hexagon (the HSBC Bank symbol rotated 
45° anti-clockwise) shown below with the vertices labeled a, b, and c. Mapping these vertices 
to the corresponding vertices of the simplex defines a piecewise-linear 'folding' map tt^ whose 
degree is 4 — 2 = 2. 

b a 

a jj ^Xr 

Suppose now that A = A fl Z", where A is a lattice polytope. A triangulation A^ is 
foldable if its facet simplices may be properly 2-colored, which is equivalent to there being 
a coloring of the vertices A of the triangulation with n + 1 labels, where each simplex 
receives all n+1 labels |59]. Both the 2-coloring and the vertex-labeling are unique up to 
permuting the colors and labels. The difference in the number of simplices of different colors 
is the signature o"(cu') of the balanced triangulation. Up to a sign, it is the degree of the 
combinatorial folding map from A to an n-simplex given by the labeling of A. 

The vertex labels define a linear projection tt^^ : P-^ )■ P". Let be the standard basis 

vector of P-^ corresponding to the monomial a ^ A and Bj for j = 0, 1, . . . , n the standard 
basis vector of P". We assume that the labels in the balanced triangulation take values 
0,1, ... ,n. Then TT^{ea) = ej, where the vertex a of has label j. The restriction of this 
linear projection to the coordinate spheres ti^j : 5"^ — )■ 5" is the geometric counterpart of 
this combinatorial folding map. 

From the geometry of this map, we can show that for t > and small, the restriction / 
of TT^ to t.Y^ has degree a{u). This can be deduced from the real version of the arguments 
used in the algorithmic proof of Koushnirenko's Theorem in Lecture 2. In fact. Figure [3?T] 
from Lecture 2 is reproduced from this argument in |105] . expand on this?? 
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The restrictions of ir^ to t.Y^ for t near zero and t = 1 are homotopic, if t.Yji^DkeT^TT^) = 
0, for t G (0, 1). This condition is not hard to check on specific examples. 

6.4 Reprise: polynomial systems from posets 

Suppose now that P is a poset and A C {0, l}"^ is the set of indicator functions of order 
ideals / of P, so that a Wronski polynomial for P has support A. Then Ap := conv(^) is 
the order polytope of P. This was studied by Stanley |116] . Its facet inequalities are 



Give more details in these brief paragraphs It has a unimodular triangulation whose 
simplices correspond to linear extensions of P, and it is easy to see that it is regular. 
Unimodular implies that ZA = Z". Recently, Stanley |117] and others introduced the 
notion of sign- (im) balance of posets, (t{P), and it is tautological that cr{uj) = cr(P). It is 
also easy to use the description of the facet inequalities to show that col(i?) is saturated, 
and if every maximal chain in P has the same parity, then is Cox-oriented. This explains 



Discuss the phenomenon of gaps. 

6.5 Sagbi degenerations 

This method of computing a degree through a limiting process applies to the original result 
of Eremenko and Gabrielov on the degree of the Wronski map in Schubert calculus [33] . The 
Grassmannian Gr(p, m+p) of p-planes in (m-l-p)-space admits a sagbi degeneration to the 
toric variety associated to the poset Pp^m which is the product of a chain with p elements 
and a chain with m elements, and one may show that the degree of the Wronski map for 
the Grassmannian Gr(p, m+p) is equal to the degree of the Wronski map / (the restriction 
of 71^) on Y^, where A is the set of monomials coming from order ideals of Pp^m- (For this, 
we use a more general version of the Wronski projection tt^^ which corresponds to the more 
general Wronski polynomial of (16. 3p .) 

Using the sagbi degeneration, we can recover the results of Eremenko and Gabrielov. 
In fact, this connection between the Grassmannian and its toric degeneration, as well as 
between Eremenko and Gabrielov's formula for the degree of the Wronski map, the sign- 
imbalance of Pp^m, and the geometric folding map was the genesis of this work |105] . 

6.6 Open problems 

There is much more to be done in this area. Here are some suggestions. 




ii y G P is mininmal 
if y G P is maximal 
if y < X is a cover in P 



Theorem 16.11 
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1. Give more comprehensive conditions which imply that /, or its hft to some (not nec- 
essarily the one given above) double cover of is orientable. 

2. Find other methods to give polynomial systems whose degree may be computed or 
estimated. 

3. Find more balanced triangulations (see |60j). 

4. Apply these ideas to specific problems from the applied sciences. 

5. Extend any of this from unmixed systems (all polynomials have the same Newton 
polytope) to more general mixed systems (those whose polynomials have different 
Newton polytopes). We end with an example in this direction which is due to Chris 
Miliar. 

Example 6.7 Let P and Q be the two lattice polytopes given below 




A polynomial with support P has the form 

g := A{x) + yB{X) , 

where A and B are univariate polynomials in x with degree m. Suppose that their coefficients 
are qq, . . . ,am and Bq, . . . ,bm, and let h be the polynomial with support Q, 

h := c + dxy + ex^y + fxy"^ . 

By Bernstein's Theorem (Theorem ll.2p . the mixed system g{x,y) = h{x,y) = will have 
2m + 2 solutions in (C^)^, as 2m + 2 is the mixed volume of P and Q. (For polygons P, Q, 
the mixed volume is vol(P + Q) — vol(P) — vol{Q).) We can compute an eliminant for this 
system by substituting —A{x) for yB{x) in h ■ B{x)^, to obtain 

cB{xf - dxA{x)B{ x) — ex A[x)B{x) + fxA[x) . 

This has constant term cb^ and leading term —eambm- If ce > and ambm > 0, then these 
have different signs, which implies that the mixed system has at least one positive root 
(and hence at least two real roots). This may be ensured by the condition that none of the 
coefficients vanish and the two linear equations, c + e = am + bm = 0. '^X^ 
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Chapter 7: 

Enumerative Real Algebraic 
Geometry 

Enumerative geometry is the art of counting geometric figures satisfying conditions imposed 
by other, fixed, geometric figures. For example, in 1848, Steiner |118j asked how many plane 
conies are tangent to five given conies? His answer, 6^ = 7776, turned out to be incorrect, and 
in 1864 Chasles [20] gave the correct answer of 3264. These methods were later systematized 
and used to great effect by Schubert [97j, who codified the field of enumerative geometry. 

This classical work always concerned complex figures. It was only in 1984 that the 
question of reality was posed by Fulton [HI p. 55]: "The question of how many solutions of 
real equations can be real is still very much open, particularly for enumerative problems." 
He goes on to ask: "For example, how many of the 3264 conies tangent to five general conies 
can be real?" He later determined that all can be real, but did not publish that result. 
Ronga, Tognoli, and Vust [90] later gave a careful argument that all 3264 can be real. 

Since this work of Ronga, Tognoli, and Vust, there have been many geometric problems 
for which it was shown that all solutions may be real. This means that the upper bound of 
d (= number of complex solutions) is sharp for these problems. In this chapter, we describe 
some of these problems, beginning with the problem of conies, and concentrating on the 
Schubert calculus. A survey of these questions (circa 2002) is given in |112j . 

7.1 3264 real conies 

The basic idea of the arguments of Fulton and of Ronga, Tognoli, and Vust is to deform the 
same special configuration. We will sketch the idea in Fulton's construction. 

Suppose that are the lines supporting the edges of a convex pentagon and 

Pi & £i, i = 1, . . . ,5 are points in the interior of the corresponding edge. 




The points in this example are {(0, 0), (^, |), (|, |), (— ^), (—1, 1)}, and the corresponding 
slopes of the lines are 0, |, — 1, ^, — 3. 



72 



For every subset S of the lines, there are 2™™''^''^''^"''^'^' conies that are tangent to the 
hnes in S and that meet the 5— IS"! points not on the hnes of S. This is the number of 
complex conies, and it does not depend upon the configuration of (generic) points and lines. 
However, when the points and lines are chosen in convex position, then all such conies will 
be real. Altogether, this gives 

2°© + 2^ (J) + 2^g) + 2^© + 2^(3 + 2°© = 102 

real conies, that, for each i = 1, . . . , 5 either meet Pi or are tangent to ii. We draw these in 
Figure 17.11 Since our pentagon was asymmetric, exactly 51 of these conies meet each point 




Figure 7.1: 102 conies 

Pi and none of the 51 conies tangent to ii are tangent at Pi. 

The idea now is to replace each pair pi G £j by a hyperbola hi that is close to the pair 
Pi G ii, in that hi lies close to its asymptotes, which are two lines close to ii that meet at Pi. 
If we do this for one pair pi & ii, then, for every conic in our configuration, there will be two 
nearby conies tangent to hi. To see this, suppose that i = 1. Then the set C of conies which 
satisfy one of the conditions "meet pj" or "tangent to ij" for each j = 2, 3, 4, 5 will form an 
irreducible curve C. For each conic in C that meets pi, there will be two nearby conies in 
C tangent to hi near pi, and for each conic in C tangent to ii, there will be two nearby 
conies in C tangent to each of the two nearby branches of hi. We illustrate this when C is 
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the curve of conies tangent to £2,^3,^4, and £5, showing the conies in C, 




and then a closeup near li. 

Conies in C 
meeting pi 



Nearby 
conies 



Conic in C Nearby 
tangent to ii conies 




For the configuration of 102 conies of figure I7.H the hyperbola 

hi : {y - j-^x){y + ix) + ^ = 0, 



is sufficiently close to its asymptotes, which meet at pi and are sufficiently close to £1, and 
so each of our 102 conies that meet pi or are tangent to £1 becomes two conies tangent to 
hi. We first show the configuration of 102 conies of Figure [7?T] in the neighborhood of £1, 
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and then the resulting 204 conies in the same region. 




The key to the existence of this construction was that no tangent direction to a conic through 
Pi met hi, which is possible as no conic was tangent to ii at pi. Figure 1772] shows the resulting 
204 conies that are tangent to hi and, for each i = 2, 3, 4, 5 either contain pi or are tangent 
to ii. 




Figure 7.2: 204 conies 

If we now replace p2 G £2 by a similar nearby hyperbola, then the 204 conies become 
408. Replacing p^ G £3 by a nearby hyperbola, will give 816 conies. Continuing with p^ G £4 
gives 1632, and finally replacing p^ G £5 with a hyperbola gives five hyperbolae, hi, ... ^h^ 
for which there are 2^ ■ 102 = 3264 real conies tangent to each hi. In this way, the classical 
problem of 3264 conies can have all of its solutions be real. Observe that this discussion also 
gives a derivation of the number 3264 without reference to intersection theory [39] . 
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7.2 Some geometric problems 

We discuss some other geometric problems that can have all their solutions be real. 
7.2.1 The Stewart-Gough platform 

The position of a rigid body in has 6 degrees of freedom (3 rotations and 3 translations). 
This is exploited in robotics, giving rise to the Stewart-Gough platform [l6l 1119] : Suppose 
that we have 6 fixed points Ai, A2, ■ ■ ■ , Aq in space and 6 points Bi, B2, ■ ■ ■ , Bq on a rigid 
body B (the framework of Figure [7731) . The body is controlled by varying each distance /j 




Figure 7.3: A Stewart platform. 

between the fixed point Ai and the point Bi on B. This may be accomplished by attaching 
rigid actuators between spherical joints located at the points Ai and Bi, or by suspending 
the platform from a ceiling with cables. (Apparently, this configuration is often used in 
factories.) 

Given a position of the body B, the distances /i, I2, ■ ■ ■ ,Iq are uniquely determined. A 
fundamental problem is the inverse question (in robotics, this is called the forward problem): 
Given a platform (positions of the Ai fixed and the relative positions of the Bi specified) 
and a sextuple of distances li,l2, ■ ■ ■ Je, what is the position of the platform? 
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It had long been understood that several positions were possible for a given sextuple of 
lengths. This led to the following enumerative problem. 

For a given (or general) Stewart platform, how many (complex) positions are there 
for a generic choice of the distances Zi, I2, . . . , Ze? How many of these can be real? 

In the early 1990's, several approaches (including a nice interaction between theory and 
computer experimentation [721191]) showed that there are 40 complex positions of a general 
Stewart platform. The obviously practical question of how many positions could be real 
remained open until 1998, when Dietmaier introduced a novel method involving numerical 
homotopy to find a platform and value of the distances li,l2, ■ ■ ■ ,h with all 40 positions real. 

Theorem 7.1 (Dietmaier |27] ) All 40 positions can be real. 
7.2.2 Real rational cubics through 8 points in P| 

Recall from Section [TT^ that there are 12 singular (rational) cubic curves containing 8 general 
points in the plane. Kharlamov studied this over the real numbers. 

Theorem 7.2 ( |23|, Proposition 4.7.3]) Given 8 general points in P^, there are at least 
8 real rational cubics containing them, and there are choices of the 8 points for which all 12 
rational cubics are real. 

The bound is a nice exercise in Euler characteristic. A homogeneous cubic has 10 coeffi- 
cients, so the set of plane cubics is naturally identified with 9-dimensional projective space. 
Let pi, . . . ,ps be general points in MP^. As the condition for a cubic to contain a point pi is 
linear in the coefficients of the cubic, there is a pencil (P^) of cubics through these 8 points. 
Let P, Q be two distinct cubics in this pencil, which is then parametrized by sP + tQ for 
[s,t] G P^. By Bezout's Theorem, the cubics P and Q, and hence every cubic in the pencil, 
vanish at a ninth point, pg. 

It is not hard to see that there is a unique cubic in the pencil that vanishes at any point 
p G MP^ — {pi, . . . ,^9}. A little harder, but still true, is that there is a unique cubic in the 
pencil with any given tangent direction at some point Pi. In this way, we have maps 

Z := Bl|p^^...^pg}MP^ 
C\ Jvr 

Mpi MP2 

where Bl{p^^...^pg}MP^ is the blow-up of MP^ in the 9 points, which is obtained by removing 
each point pi and replacing it with the tangent directions MP^ ~ 5*^ to MP^ at p,,. The map 
TT is the blow-down, and the map C associates a point of Z to the unique curve in the pencil 
which contains that point. 

Because Z is a blow-up, we may compute its Euler characteristic to obtain 

X{Z) = x(KP') - 9 ■ x(pt) + 9 ■ x{S') 
= 1-9 + 
= -8. 
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The key to Theorem 17.21 is to compute the Euler characteristic of Z a second way using the 
map C : Z — )■ S^. The fibers of this map are the cubic curves in the penciL Smooth real 
cubics either have one or two topological components, 




and hence are homeomorphic to one or two copies of S^. In either case, their Euler character- 
istic is zero, and so the Euler characteristic of Z, —8, is the sum of the Euler characteriatics 
of singular fibers of the map C. 

Because the points pi, . . . ,ps were general, the only possible singular fibers are nodal 
cubics, and there are two types of real nodal cubics. 




The first is the topological join of two circles and has Euler characteristic —1, while the 
second is the disjoint union of a circle with a point and therefore has Euler characteristic 1. 
Thus, if n is the number of nodal cubics in the pencil and s is the number with a solitary 
point, then we have n + s < 12 (as there are 12 complex rational cubics), and s — n = —8. 

There are three solutions to this system, (n, s) G {(8, 0), (9, 1), (10, 2)}. Thus there are 
at least 8 real cubics through the 8 points, and this is the derivation of the Welschinger 
number = 8. Moreover, if there are 2 cubics in the pencil with solitary points, then all 
12 rational cubics will be real. Such a pencil is generated by the two cubics given below, 
whose curves we also display. 




We conclude that there will be 12 real rational cubics interpolating any subset of 8 of the 9 
points where these two cubics meet. 

The question of how many of the A^^ (11.61) rational curves of degree d which interpolate 
3d—l points in MP^ can be real remains open. (This was posed in |1U6] .) 
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7.2.3 Common tangent lines to 2n— 2 spheres in MJ^ 

How many common tangent lines are there to 2n—2 spheres in M"? For example, when 
n = 3, how many common tangent lines are there to four spheres in R^? (The number 2n— 2 
is the dimension of the space of lines in M" and is necessary for there to be finitely many 
common tangents.) Despite its simplicity, this question does not seem to have been asked 
classically, but rather arose in computational geometry. Macdonald, Pach, and Theobald [77] 
gave an elementary argument that four spheres with the same radius in can have at most 
12 common tangents. Then they considered the symmetric configuration where the spheres 
are centered at the vertices of a regular tetrahedron. If the spheres overlap pairwise, but 
no three have a common point, then there will be exactly 12 common real tangents, as 
illustrated in Figure EH 




Figure 7.4: Four spheres with 12 common tangents. 



The general case was established soon after |114j . 

Theorem 7.3 2n — 2 general spheres in M" (n > 3) have 3 ■ 2""^ complex common tangent 
lines, and there are 2n — 2 such spheres with all common tangent lines real. 

The same elementary arguments of Macdonald, Pach, and Theobald give a bound valid 
for all n and for spheres of any radius, and a generalization of the symmetric configuration 
of Figure 17.41 gives a configuration of 2n — 2 spheres having 3 ■ 2"~^ common real tangents. 

Megyesi [78] showed that this result for n = 3 remains true if the spheres have coplanar 
centers (Figure [73]) . but that there can only be 8 common real tangents (out of 12 complex 
ones) if the spheres have the same radii (Figure [7l6|) . 
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Figure 7.5: Four spheres with coplanar centers and 12 common tangents. 




Figure 7.6: Four equal spheres with coplanar centers and 8 common tangents. 

The spheres in Figures 17.41 and 17.51 are not disjoint, in fact their union is connected. 
Fulton asked if it were possible for 4 disjoint spheres to have 12 common real tangents. It 
turns out that a perturbation of the configuration of Figure 1731 gives four pairwise disjoint 
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spheres with 12 common tangents, as we show below. 




The three large spheres have radius 4/5 and are centered at the vertices of an equilateral 
triangle of side length v^, while the smaller sphere has radius 1/4 and is centered on the 
axis of symmetry of the triangle, but at a distance of 35/100 from the plane of the triangle. 
It remains an open question whether it is possible for four disjoint unit spheres to have 12 
common tangents. 

7.3 Schubert calculus 

The largest class of problems which have been studied from the perspective of having all 
solutions real come from the classical Schubert calculus of enumerative geometry, which 
involves linear spaces meeting other linear spaces. The simplest nontrivial example illustrates 
some of the vivid geometry behind this class of problems. Consider the following question: 

How many line transversals are there to four given lines in space? 

To answer this, first consider three lines. They lie on a unique hyperboloid. (See Fig- 
ure [7]71) This hyperboloid has two rulings by lines. The three lines are in one ruling, and 
the other ruling (which is drawn on the hyperboloid in Figure [7T|) consists of the lines which 
meet the three given lines. 

The fourth line will meet the hyperboloid in two points (the hyperboloid is defined by 
a quadratic polynomial). Through each point of intersection there will be one line in the 
second family, and that line will meet our four given lines. In this way, we see that the 
answer to the question is 2. Note that the fourth line may be drawn so that it meets the 
hyperboloid in two real points, and both solution lines will be real when this happens. 

Let Gr(p, m+p) be the Grassmannian of p-dimensional linear subspaces (p-planes) in 
an (m+p)-dimensional vector space, which is an algebraic manifold of dimension mp. The 
Schubert calculus involves fairly general incidence conditions imposed on p-planes H. These 
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Figure 7.7: Hyperboloid containing three lines. 



general conditions are imposed by flags, which are sequences of linear subspaces, one con- 
tained in the next. More specifically, a fiag F, is a sequence 

: -^1 C F2 C ■ • ■ C Fm+p^i C Fm+p , 

where Fi is a linear subspace having dimension i. 

These general conditions are called Schubert conditions and are indexed by sequences 
a: 1 < ai < a2 < ■ ■ ■ < Op < m+p of integers. Write (''"^^') for the set of all such 
sequences. The set of all p-planes satisfying the condition a imposed by the fiag F, is a 
Schubert variety, defined by 

X„F. := {H e Gr(p, m+p) \ dimH (1 F^^ > j for j = 1, . . . ,p} . 

This is a subvariety of the Grassmannian of dimension |a| := ai — 1 + a2 — 2 + ■ ■ ■ + ap — p. 
Its CO dimension is mp — 

If is a linear subspace of codimension p and F, is a fiag with F^ = K, then 

XaF, := X(^m,m+2,...,m+p)F, = {H I HnK^{Q}}. 

The condition that H meets K is is called a simple Schubert condition and XqF, is a simple 
Schubert variety. More generally, a special Schubert variety of codimension a is 

XaF, := X 

{m+l—a,m+2,...,m+p) 

F, = {H \ Hn 7^{0}}. 

Theorem 7.4 Given Schubert conditions a\ a^, . . . , a" m ('™^^') wt/i l"^*!) — "^P' 

there exist real flags F^,F^, . . . , F^ such that the intersection 

n 

f]X^.F: (7.1) 

i=l 

is transverse with all points real. 
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The numerical condition that Ylii'^P ~ 1*^*1) — imphes that the expected dimension 
of the intersection is zero. If the flags are in general position, then the Kleiman-Bertini 
Theorem [67] implies that the intersection is in fact transverse and zero-dimensional, so 
there will be finitely many complex p-planes satisfying the incidence conditions a* imposed 
by general flags, and these p-planes are exactly the points in the intersection. 

Theorem [73] was proved in several stages. First, when p = 2 |107j . and then for any p, but 
only for special Schubert conditions [108j . and then finally for general Schubert conditions 
by Vakil [T27] - 

7.4 Quantum Schubert calculus 

Given points Si, S2, . . . , Sd{m+p)+mp e and m-planes Ki,K2,..., Kd{m+p)+mp in C™+p, there 
are finitely many rational curves 7 : — > Gr(p, m+p) of degree d so that 

7(si) n Ki ^ {0} i = l,2,...,d{m+p) + mp. (7.2) 

These are simple quantum Schubert conditions. More generally, one could (but we will 
not) impose the condition that the p-plane 7(si) lie in some predetermined Schubert variety. 
The number of solutions to such problems are certain Gromov-Witten invariants of the 
Grassmannian, and may be computed by the quantum Schubert calculus [H [511 1102i 1126] . 

Theorem 7.5 ( [109] ) There exist real points Si, S2, ■ ■ ■ , Sct(m+p)+mp G MP^ and real m- 
planes Ki,K2, . . . , Kd(m+p)+mp in K™^^ so that every rational curve 7: P^ — t- Gi{p,m+P) 
of degree d satisfying (17. 2p is real. 

7.5 Theorem of Mukhin, Tarasov, and Varchenko 

In May of 1995, Boris Shapiro communicated to the author a remarkable conjecture that he 
and his brother Michael had made concerning reality in the Schubert calculus. They conjec- 
tured that there would only be real points in a zero-dimensional intersection of Schubert va- 
rieties given by fiags osculating the rational normal curve. Subsequent computation [ ^IllU] 
gave strong evidence for the conjecture and revealed that the intersection should be trans- 
verse. Partial results were obtained |108[ [M]. and the full conjecture was proven by Mukhin, 
Tarasov, and Varchenko [81]. They later gave a second proof [82], which different from their 
original proof and gave a proof of transversality. 

This Shapiro Conjecture has been a motivating conjecture for the study of reality in the 
Schubert calculus with several interesting (and as-yet-unproven) generalizations that we will 
discuss in subsequent chapters. Let 7 be the rational normal (or moment) curve in C'"^^, 
which we will take to be the image of the map 

7(t) = (1, t, t\ r+P-i) G C^+P, 

defined for t G C. Given a point t G C, the osculating Hag F,{t) is the fiag of subspaces 
whose i-plane is the linear span of the first i derivatives of 7, evaluated at t 

F,{t) := span{7(t), 7'{t), 7"(t), j^'-'\t)} . 
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This flag makes sense for t G P . Here is tlie strongest form of the Shapiro conjecture that 
has been proven |82j. 

Theorem 7.6 Ifa^, a^, . . . , a" G {^"^p^^) are Schubert conditions satisfying '^i{'mp—\a^\) = 
mp, then, for every choice of n distinct points Si, S2, . . . , s„ G MP^, the intersection 

n 

1=1 

is transverse with all points real. 

It is instructive to consider this for the problem of two lines that we saw in Section 17.31 
An osculating line will be a tangent line to the rational normal curve 7. The Shapiro 
Conjecture asserts that given four lines that are tangent to the rational normal curve at real 
points, there will be two lines meeting all four, and the two lines will be real. As before, 
first consider three lines tangent to the rational normal curve. They will lie in one ruling of 
a quadric surface, and the other ruling consists of the lines meeting all four. 

For example, if we let ■y(t) = (6t^ — 1, jt^ + |t, —^t^ + ft), and consider tangent lines 
£(-1), £{0), and to 7, 

^(±1) : (5,±5, 1) + m(±1,1,0) £(0) : (-1, 0, 0) + m(0, 1, 1) uER, 

then the quadric is x"^ — y"^ + z"^ = 1. We display this in Figure [721 together with the rational 
normal curve. Since any three real points on any real rational normal curve may be carried 




Figure 7.8: Quadric containing three lines tangent to the rational normal curve. 

to any three real points on any other real rational normal curve by an automorphism of 
projective 3-space, we can assume that we are in the situation of Figure 17.81 

The rational normal curve 7 (a cubic) meets the quadric tangentially at the points where 
it is tangent to the lines. Since the total multiplicity of its intersection with the quadric 
is six, these are its only points of contact. In particular, 7 always lies on one side of the 
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quadric, looping around on the inside. In this way we can see that a fourth tangent hne 
to the rational normal curve must meet the quadric in two real points, giving the two real 
transversals asserted by Shapiro's Conjecture. 

It is no loss to assume that the fourth tangent line is tangent at some point ^{v) on the 
arc between points 7(— 1) and 7(0). As illustrated in Figure fL9\ the tangent line i{v) does 
indeed meet the hyperboloid in two real points. 




Figure 7.9: Configuration in the throat of quadric. 
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Chapter 8: 

The Shapiro Conjecture for 
Grassmannians 

In Chapter [TJ we considered the rational normal curve 7 in C*""^^, which we took to be the 
image of the map 

7(t) = (1, t, . . . , r+P-i) G C^+P , 

defined for t G C. For a point t G C, the osculating fiag F,{t) is the fiag of subspaces whose 
i-plane is the linear span of 7(t) and the the first i—1 derivatives of 7, evaluated at t, 

F,{t) := span{7(t), YW, l"{t). 7^^"'^^)}- 

We may also define F,{oo) to be the limit as s — )■ of to get a family of fiags F,{t) 

for t G 

We work in the Grassmannian Gr(p, m+p) of p-planes in C™"*"^. Recall from Section [7^ 
that Schubert varieties are indexed by Schubert conditions a G which are integer 

sequences a: 1 < ai < a2 < ■ ■ ■ < oip < m+p. Given a fiag F,, the corresponding Schubert 
variety is 

XaF, = {HeGY{p,m+p) I dimHnFa^yj for j = l,...,p}. 

This has dimension \a\ := ~ J)- 

We investigate the Shapiro conjecture for Grassmannians (Theorem of Mukhin, Tarasov, 
and Varchenko). 

Theorem 17.61 /fa^. a^. . . . ,a"' are Schubert conditions with Ylii''^P~ ~ ''^P' then, for 
every choice of n distinct points Si, S2, . . . , s„ G MP^, the intersection 

n 

f]X^.F.{s,) 

1=1 

is transverse with all points real. 

In particular, we show that a special case of this theorem is equivalent to the statement 
(Theorem 11.91) of the Shapiro conjecture from the Introduction, and then prove this special 
case in an asymptotic sense. 

8.1 The Wronski map 

When all the Schubert conditions are simple (so that a = □ = m, m + 2, . . . ,m+p and 
\a\ = mp — 1), the Shapiro conjecture has another formulation in terms of the Wronski 
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map. The Wronskian of a list fi(t), /2(t), . . . , fm(t) of polynomials of degree m+p—1 is the 
determinant 



Wr(/i,/2,...,/™,) := det( (-j f,{t) 

i,j=l,...,m 

which is a polynomial of degree mp, when the polynomials fi{t), f2{t), . . . , fm{t) are generic 
among all polynomials of degree m+p—1. 

Up to a scalar factor, this Wronskian depends only upon the linear span of the polyno- 
mials flit), f2{t), . . . , fmit). Removing these ambiguities, we get the Wronski map 

Wr : Gr(m,m+p) — y P™^ , (8.1) 

where Gr(m,p) is the Grassmannian of m-planes in the space of polynomials of degree 
m+p—1, and P*"^ is the space of polynomials of degree mp, modulo scalars. 

8.1.1 Some linear algebra 

Let us begin with the moment (rational normal) curve. For t E C, set 

7(t) = (l,t,t2,...,r+»'-^) e C^+f . 

Let T = T{t): C" C™+p be the map such that 

r(e,) = ^^'-'\t), (8.2) 

the (i— l)-th derivative of 7. (We will take ei, 62, . . . to be the standard basis vectors of the 
vector space in which we are working.) 

A polynomial / corresponds to a linear form (also written /): 

/ . c^+p ^ c so that / o 7(t) = f{t) . 
The matrix in the definition of the Wronskian is the matrix of the composition 

where the columns of are the linear forms defining the polynomials /i, /2, • • • , /m- Now 
suppose that H is a basis for the kernel of the map We consider if to be a map 
CP ^ C^+P, which we sum with r(t) to get a map 

This map is invertible if and only if the composition (18. 3p is invertible. Thus, up to a 
constant, we have 

Wr(/i,/2,...,/J = det[r(t) : H] , (8.4) 

as both are polynomials of the same degree with the same roots. (Strictly speaking, we 
need the Wronskian to have distinct roots for this argument. The general case follows via a 
limiting argument.) 
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We obtain a useful formula for the Wronskian when we expand the determinant ( 18.4^ 
along the columns of r{t) 



WT{h,h,...,fm) = det[r(t) : H] = Y,i-^PPc^int))-Pa4H). (8.5) 

a 

Here, the sum is over all a G (''"^^'), which are choices {ai < a2 < ■ ■ ■ < am} C [m+p] of 
m distinct rows of the matrix T{t). Also, a'^ = [m+p] — a are the complimentary rows of H, 
and Pa{^{t)) is the ath maximal minor of T[t), which is the determinant of the submatrix 
of T{t) formed by the rows in a, and similarly for pa<^{H). Observe that a'^ G (''"^1'^'). 

There is a similar expansion for the Wronskian using the composition (18.31) . Take the 
top exterior power (A™) of this composition, 

where we have used the ordered basis of C™ so that A^C" = C • ei A e2 A . . . A e^, which 
identifies A"^C"^ with C. Then A"^T{t) is simply a vector in /\^C^+p and A'"\E' is a covector 
for A™C"+P. If we use the basis e« := A e^^ A . . . A e„„ for a^C^+p, where a G ('"+^^) , 
then we see that the Wronskian has the form 

Wr(A,/2,...,/J = Yl Paim)-Pam. 

Here, pa(r{t)) and Pai"^) are the ath coordinates of the corresponding vector /covector 
(which are the ath maximal minors of the corresponding matrices). Equating these two 
expressions for the Wronskian gives the equality (again up to a constant) 

E i-ipPcim) ■ pah) = Yl P'^im)-PaW. 

This argument does not use much about the matrix T{t), besides that it depends upon 
an indeterminate t. Replacing r(t) by a matrix of indeterminates proves an interesting (and 
well-known) matrix identity. 

Proposition 8.1 Suppose that \E' and H are matrices of format n by k and {n — k) by n 

respectively such that \E' o = (the image of H is the kernel of \E'j. Then there is a 
constant C so that 

p^i^) = C-i-lpp^H) , 

for allae (tj). 



8.1.2 Connection to Schubert calculus 

We explore some geometric consequences of the determinantal formulas (18. 4 p and (18. 5p . 
Let H be the column space of the matrix H, which is the kernel of the map \E'. Then H 
is a point in the Grassmannian Gr(p, m+p). From the definition (18. 2 p of F, we see that 
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the column space of the matrix r{t) is the m-plane Fm{t) osculating the rational normal 
curve 7 at the point ■jit). From (18 .4^ and f l8.5p . we see that s is a zero of the Wronskian 
:= Wr f2{t),..., fm{t)) if and only if 







det[r(s) : H]. 



This implies that there is a linear dependence among the columns of this matrix and thus 
there is a nontrivial intersection between the subspaces Fm{s) and H. 

Suppose that a polynomial has distinct zeroes Si, S2, . . . , Smp- Then the columns of 
the matrix H are linear forms cutting out the linear span of polynomials /2(t), . . . , fm{^) 
of degree m+p—1 having Wronskian <l>(t) 

■<=^ the p-plane H meets the m-plane Fm{si) nontrivially for each 2 = 1,2,..., mp, 

^ H lies in the Schubert variety X^F,{si) for each i = 1, 2, . . . , mp. 

If the roots si, . . . , Smp are all real, then the Shapiro Conjecture (Theorem 17.61) asserts that 
all such p-planes H are real, and there are the expected number of them. In this way, the 
second part of Theorem 11.91 is a consequence of Theorem I7.6[ 

Second part of Theorem 11.91 // the polynomial $(t) G has simple real roots then 
there are H^mp i"^o,l points in Wt~^{F). 

Recall from Chapter [1] the formula for the degree of the Wronski map. 



which is the number of inverse images of a regular value of the Wronski map. The first part 
of Theorem II. 9[ which asserts that all points are real in a fiber of the Wronski map over 
a polynomial with only real roots, follows from the second by a limiting argument that we 
give in Section 18.2.31 

8.2 Asymptotic form of the Shapiro Conjecture is true 

It is not too hard to show that the conclusion of the Shapiro conjecture when all conditions 
a* are simple, a* = □, holds for some Si, . . . , Smp G R- We give a (by now) standard argument 
for the following asymptotic form of the Shapiro conjecture, which was given in |108j . and 
independently in [33] . 

Theorem 8.2 There exist real numbers Si, S2, . . . , Smp such that 



'mp 



l!2!---(m-l)! ■ [mp]\ 
p\{p+l)\ ■ ■ ■ {m+p—l)\ 



XaF.isi) n XaF,{s2) n • • ■ n XaF,{smp) 



is transverse with all points real. 
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The proof is a version of Schubert's principle of degeneration to special position and 
the same ideas can be used to establish similar results for other flag manifolds and related 
varieties, such as Theorem 17.51 on rational curves in Grassmannians. 

Interchanging a with a'^, the expansion (18.51) becomes (up to a sign) 

det[r(t) : E\ = Pa(^)-(-i)'"W=(r(t)). 

We convert this into a very useful form by expanding the minor pQ,c(r()f:)). Let a G ('™^^^)- 
Observe that the determinant 



det 



Ol — 1 



(ai - l)t"i 



Ql-2 



a 



is equal to 



• det 



1 (tti-l) 
1 (a2-l) 

1 (a™ - 11 



m 


l)t 


(ai- 


-1)! 


(ai- 


m)! 


("2- 


-1)! 




m)! 


(«m 


-1)! 


(am- 


-m)! 



(qi 


-1)! 


(oi- 


-m)\ 


("2 


-1)! 


(o2- 


-m)! 


(Om 


-1)! 


(Om- 


-m)! 



• det 



1 ai 

1 02 



On 



771— !■ 
1 

m— 1 



m—1 



(The second equality is via column operations.) We recognize this last determinant as the 
Van der Monde, ni<j('^j ~ ^i)- Write Va for the product 



;-i)'"'-nK-- 



Since ja'^l = mp — \a\, we obtain the expansion for the Wronskian (up to a global sign). 



det[r(t) : H] = Yl t"'''~^'"^V^cp^{H) 
«e('T') 



(8.6) 



Observe that if we write Ua '■= Vacpa{H), then (18. 6p becomes 



In particular, the coefficient of i/a depends only upon the rank, of a G {^"'+p^). Com- 
pare this to the Wronski polynomial (16. 2p of Section 16. ![ and compare the original polyno- 
mial dHSD to fO]) . 
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8.2.1 Schubert varieties 



We transpose all matrices, replacing column vectors by row vectors. Let H G Gr(p, m+p) 
be represented as the row space of a p by (m+p)-matrix, and apply Gaussian elimination 
with pivoting from bottom to top and right to left to H to obtain a unique representative 
matrix of the form 



H 



span 



* 



V* 



* 1 ■ 

* * • 



* * 





* 1 

; 

* : * 








1 







0/ 



(8.7) 



Here, the entries * indicate an unspecified element of our field (M or C). 

The set of columns containing the leading Is (pivots) is a discrete invariant of the linear 
subspace H. Let a: ai < a2 < ■ ■ ■ < ap he the positions of the pivots, that is, ai is the 
column of the leading 1 in row i. Observe that PjsiH) = unless /3j < ttj for every i. This 
coordinatewise comparison defines the Bruhat order on the indices a G ([™+p1). The set of 
linear spaces whose row reduced echelon forms (18. 7p have pivots in the columns of a forms a 
topological cell of dimension called the Schubert cell and written X°. The undetermined 
entries * in (18. 7p show that it is isomorphic to A'"', the affine space of dimension \a\. 

We ask: Which linear spaces are in the closure of the Schubert cell? For the answer, let 
Ma be the set of matrices with full rank p where the entries in row i are undetermined up 
to column a,-, and are thereafter. These matrices have the form 



* • 

V* ■ 



* 

* * 





* 








* 



■ o\ 

■ 

■ 

■ oy 



where the last undetermined entry * in row i occurs in column a^. This is a closed subset of 
the set of p by (m+p)-matrices of full rank p. The pivots /3 of a matrix M in will occur 
weakly to the left of the columns indexed by a, so that (3 < a, and all possibilities /3 can 
occur. 

In particular, this shows that the set of p-planes H parameterized by matrices in 
is the union of the Schubert cells indexed by /3 for /3 < a in the Bruhat order. This is 
a closed subset of the Grassmannian, in fact it is one of the Schubert varieties defined in 
Chapter [3 To see this, let ei, 62, . . . , e^+p be basis vectors corresponding to the columns of 
our matrices. For each i = 1, . . . , m+p let Fi be the linear span of the vectors ei, 62, . . . , e^. 
From the form of matrices in M^, we see that if H is the row space of a matrix in M„, then 
we have 

dim H n Fa- > j for j = 1, . . . , p . 

These dimension inequalities define the Schubert variety XaF,. Note that if if G XaF,, 
then pp{H) = unless (3 < a. Write for the Schubert cell consisting of those H of the 

form flHTj) . 

The key lemma in our proof of Theorem 18.21 is due essentially to Schubert 
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Lemma 8.3 For any a E (''"^'''), 

X^F.n{H\p^{H)=0} = \JXpF.. 

I3<a 

Here /3<a means that P < a, but there is no index /i in the Bruhat order with 13 < fi < a. 
This is easy to see set-theoretically, as for H e X^F, we have pp^H) = unless (3 < a. 

It is also easy to see that this is true on the generic point of each Schubert variety XjsF, 
for P < a. Fix some index /3 with P < a. Then there is a unique index k with Pk = c^k — 1, 
and for all other indices i, Pi = ai. Consider the subset of the matrices Mq,, where we require 
the entries in row i and column Pi to be 1, and write Xk,au fo^^ the entry in row k and column 

*10---0 o---ooo---o\ 

; ■•• : : : ; ; ; ; 

* * ■■■ * 1 Xk,a^ 0---0 0- --0 _ (8,8) 

; ; ; ; o o ••. ; ; ; 

* : : 1 0---0y 

The row spans of these matrices form a dense subset of the Schubert variety X^F,, and 
therefore define a coordinate patch for X^F,. If we set Xk,a,, = 0, then we get all matrices 
of the form (18.71) . but for the index p. 

If H is the row space of a matrix in this set ( 18. 8p . then Pa{H) = Xk^a^- Thus, on 
this coordinate patch for X^F,, the vanishing of the Pliicker coordinate Pa cuts out the 
Schubert variety XpF,, scheme-theoretically. Repeating this local argument for each /3 < «, 
proves (18. 3p . at least at the generic point of each component Xj^F, (which is sufficient for our 
purposes). More careful arguments show this is true even at the level of their homogeneous 
ideals. 



* . . . 

V* ■■■ 



8.2.2 Asymptotic form of Shapiro Conjecture 

We now have everything that we need to prove Theorem 18.21 We will prove a stronger 
statement using induction on the Bruhat order (sometimes called Schubert induction). 

Lemma 8.4 There exist real numbers Si, S2, ■ ■ ■ , Smp such that for all a G ('™^^'), 

H 

X^F, n flXoF.(si) (8.9) 

i=l 

is transverse with all points of intersection real. 

The statement of Theorem 18.21 is the case a = m+1, m+2, . . . , m+p, when XaF, is the 
Grassmannian. 
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Remark 8.5 It is not hard to see (it is equivalent to the Pliicker formula [88] for rational 
curves and was noted by Eisenbud and Harris [30, Theorem 2.3]) that the intersection f l8.9p 
lies in the Schubert cell X^F, for the index a. That is, every point H in the intersection (18.91) 
has row-reduced echelon form (18. 7p . for the index a. 

Observe that when a = 1,2, . . . ,p, then \a\ = and the Schubert variety Xq,F, consists 
of the single point {Fp}. Thus the base case of the induction to prove Lemma [8.41 is trivial, 
as there is no intersection to contend with. 

Suppose that we have real numbers Si, . . . ,Sj such that, for each a with \a\ = j the 
intersection (18. 9p is transverse with all points real. Let a G [^"^p^^) with |a| = Observe 
that by (18. 6p the intersection X^F, n X^F^it) is defined by the single polynomial equation 

J2t"'''-^^%^PpiH) = for HeX^F,. 

Since pp[H) = unless (3 < a, this becomes 

Dividing by the lowest power t"*?-!"! of t, this becomes 

J]tl"|-|^IV^/3cP;3(i/) = V^cp^H) + t-5^tl"|-|^|-V;3=p^(if) = 0. 
/3<a I3<a 

Since Va<= 7^ 0, we see that in the limit as t — )■ 0, this equation becomes Pa{H) = 0. Using 
the lemma of Schubert (18. 3p . we conclude that 



lim (X,F. n XaF.(t)) = X^F, H {H \ p^{H) = 0} = |J XpF, . (8.10) 

/3<a 



By our induction assumption on j, each intersection 

j 

XpF, n \^X^F,{s,) 



i=l 



is transverse with all points real, and by Remark 18.51 the intersection is contained in the 
Schubert cell XIF,. Since the Schubert cells are disjoint, we conclude that the intersection 



(\jXpF,^ n ^XuF,{s,) 



a<f} i=l 

is transverse with all points real. By the computation of the limit (18.101) . and the observation 
that transversality is preserved by small perturbations, we see that there is a number < 
such that if t < then 

j 

X^F, n XuF,{t) n {^XuF,{s,) 

1=1 
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is transverse with all points real. 

We complete the induction by setting Sj+i to be the minimum of the numbers Sa where 
|a| = J ' + 1. This proves Theorem 18.21 ^^X^ 

Similar asymptotic arguments are behind the proof of Theorem 17.51 which proved reality 
in the quantum Schubert calculus, as well as results for the classical flag manifolds and for 
the orthogonal Grassmannian |111] . 

Remark 8.6 The proof of Theorem 18.21 used induction to show that the intersection (18. 9p 
is transverse with all points real. In fact, it gives an inductive method to construct all the 
points of intersection. The induction began with a = 1,2, ... ,p so that |a| = and the 
Schubert variety X^F, consists of the single point {Fp}. When |a;| = j + 1, the limit 

j 3 

(limX,F.nXaF.(t)) n f|XaF.(si) = |J ^Z^^' ^ fl^^^'^^*) 

i=l (5<a i=l 

shows that each point in the intersection (18. 9p is connected to a point in 

j j 
[jXpF. n f]X^F.{s,) = [j{x^F. f|XoF.(s,)) (8.11) 

/3<a i=l I3<a i=l 

along a path as t ranges from s^+i to 0, and the union on the right is disjoint. 

For the inductive construction, we may suppose that the points in the set (18. lip have been 
previously constructed as /3 < a implies that \f3\ = j. Starting at one of the points in (18.1 ip 
and tracing the path from t = back to t = Sj+i gives a point in the intersection (18.90 for a 
with \a\ = and all such points in the intersection (18. 9p arise in this manner. Following 
paths along a general curve in C (as opposed to the line segment [0, Sj+i]) constructs points 
in the intersection (18. 9p where Sj+i is any complex number. This is the idea behind the 
numerical Fieri homotopy algorithm, which was proposed in [51] and implemented in |53j . 
Its power was demonstrated in [73], which used the Fieri homotopy algorithm to compute 
all solutions to a Schubert problem on Gr(3, 9) with 17589 solutions. 

If 6{a) is the number of points in the intersection (18. 9p . then this limiting process also 
gives the recursion along the Bruhat order 6 (a), 

S{l,2,...,p) = 1, 

Schubert discovered this recursion [98] and used it to compute the number 5(567) = 462 
when m = 4 and p = 3. This is the number 7^4 3 given by the formula, 

l!2!---(m-l)!-[mp]! ^ 



p\{p+l)\ ■ ■ ■ {m+p—l)\ 



which is also due to Schubert, as he solved his recursion to obtain a closed formula. This 
recursion shows that the number 6{a) may be interpreted combinatorially as the number of 
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paths from the bottom of the Bruhat order to the element a. In fact to each solution we 
constructed in (18. 9p . we may associate a path from 1, 2, ... ,p to a. 

Figure 18.11 shows the Bruhat order in this case when m = 4 and p = 3 and the recursion 
for the numbers 6{a). 




Figure 8.1: Schubert's recursion for Gr(3,7). 



8.2.3 Reduction to special case of Shapiro conjecture 

In Section I8.1.2[ we demonstrated that Theorem 11.91 is equivalent to Theorem 17.61 when all 
Schubert conditions are simple (each a* = □)• In fact this case of Theorem 17.61 implies a 
weak form of the general case, in which we do not require transversality. The main idea is 
to use the limit (I8.10p . which we must first reinterpret. The flag F, is the osculating flag 
F,(t) when t = 0. In fact, the limit (I8.10p still holds if we replace F, by -F.(s) and by s 
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for any point s of P^. That is, 



lim(X„F.(s)nXoF.(t)) = \JXf,F.{s). (8.13) 



This is simply the hmit ( I8.10p translated by the invertible matrix $(s) whose i,j-entry is 



1 /d^'-' 



as = F,{s + t). 



Theorem 8.7 Suppose that Theorem (\7.6\\ holds for the Schubert problem in which all con- 
ditions a* are simple (a* = □). Then for any a\ . . . , a" with mp = '^i{mp — |q:*|) and any 
distinct si, . . . , s„ G MF^, the intersection 



f|X,.F.(s,) (8.14) 



has all points real. 



We prove this by downward induction on the number n of Schubert conditions in fl8.14p . 
using the limit f l8.13p and the simple idea that a limit of a collection of real points is 
necessarily a collection of real points. 

First, when n = mp each a* is simple and all points in the intersection fl8.14p are real as 
that is our hypothesis in Theorem 18.71 Suppose that n < mp. Then we have < mp — 1 
for some i. Suppose that < mp — 1 and let /3 e {^^p^^) be a Schubert condition with 

< (3 so that = |a| + 1. Then 



n— 1 n—1 



\im^[f]X^.F,{s,)]n(^XpF,{sn)nX^F.{t)^ = [ f| X,.F.(s,)] n ( |J X„F.(s„)) . 

1=1 i=l a<f} 

The elementary inclusion C of the limit in the set on the right is clear from ( ]8.13p . The 
equality of the two sides follows as the intersection on the right is zero-dimensional, and 
therefore cannot contain any excess intersection. By induction, for general t G M, every 
point in the left-hand intersection is real, and so every point in the limit is real. Theorem 18. 71 
follows as a'^ < /3 and so the intersection fl8.14p is a subset of the right-hand side. 



8.3 Grassmann duality 

In Section 18.21 we showed how the Wronski formulation of the Shapiro Conjecture — m- 
dimensional spaces of polynomials of degree m+p—1 whose Wronskian has distinct real 
roots — corresponds to an intersection of hypersurface Schubert varieties in Gr(p, m+p) de- 
fined by flags that osculate the rational normal curve at the roots of the Wronskian. The first 
formulation concerns points in Gr(m, m+p) while the second concerns points in Gr(p, m+p). 
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In our proof of this correspondence we considered an m-dimensional space of polynomials 
as a space of linear forms on C'""'"'', and associated this to the p-plane annihilated by the 
linear forms. This gives a natural bijection 

Gr(m, (C™+P)*) — ^ Gr(p,C™+P). (8.15) 

Moreover, the annihilators of the subspaces in a flag F, in C™"'"^ form the dual flag F* in 
(C™^'^)*. Under the identification of Grassmannians flS.lSp . the Schubert variety XaF, of 
Gr(p, C'"+*') is identified with the Schubert variety Xa*F* in Gr(m, (C^+p)*), where 

a* : m+p+1 — a'^^ < m+p+1 — < ■ ■ ■ < m+p+1 — a\ < m+p+1 — a\ , 

that is, to obtain a*, first form the complement a'^ of a in [m+p], then subtract each 
component from m+p+1, and put the result in increasing order. This is an exercise in 
combinatorial linear algebra, and the relation |a| = |a*| is a combinatorial exercise. 

Write Cm+p-i[i] for the space of polynomials of degree at most m+j)— 1, which we iden- 
tified as the dual space to C"^"*"^. We describe the Schubert subvarieties of Gr(m, Cm+p-i[t]) 
that correspond to the Schubert varieties XaF,{s). 

Let V C Cm+p-i[i] be an m-dimensional space of polynomials. For any s G P^, V has a 
distinguished basis fi,-..,fm whose orders of vanishing at the point s are strictly increasing, 

ord,(/i) < ord,(/2) < ■■■ < ords{fm) ■ 

This follows by Gaussian elimination applied to any basis fi, . . . , fm oi V . Suppose that 
< Oi is the minimal order of vanishing at s of some /«. Reordering the basis, we may 
assume that ords(/i) = ai. Subtracting an appropriate multiple of /i from the subsequent 
elements gives a new basis, still written /i, . . . , /„, with ai < ords(/j) for i > 1. Suppose 
that ords(/2) is minimal among the orders of vanishing at s of fi for i > 1 and now subtract 
appropriate multiples of /2 from the subsequent elements, and continue. The resulting 
sequence a := (ai, . . . , am) = (ords(/i), . . . , ords(/m)) is the ramiGcation sequence of V at s. 

An elementary calculation shows that if V has ramification sequence a at a point s G P^, 
then the Wronskian of V vanishes at s to order 

\a\ = 01—0 + 02 — 1+ ■■■ +am— (m— 1). 

Define a flag E,{s) C Cm+p-i[t] where Ei{s) is the space of all polynomials that vanish 
to order at least m+p—i at s. With these definitions, we have the following lemma. 

Lemma 8.8 A space V of polynomials in Gr(m, Cm+p-i[i]) has ramification sequence a at 
s if and only if 

V G X^rE.is), 

where : m+p—am < ■ ■ ■ < m+p—ap G (''"^^') ■ 

A polynomial f{t) G Ei{s) if and only if {t — ^y^+P-* divides /, if and only if f^^\s) = 
for j = 0, 1, . . . , m+p—l—i. If we view / as a linear form on C™^^ so that f{t) = f o '~f{t), 
we see that f^-^\t) = f o 'y^^\t), and therefore f{t) G Ei(s) if and only if / annihilates the 
osculating subspace Fm+p^i{s) to 7 at 7(5). Thus -Ei(s)"'" = Fm+p-i{s), and so E,{s) is the 
dual flag to F,{s). In particular, the Schubert variety XaF,{s) corresponds to Xa*E,{s) 
under Grassmann duality. 
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Theorem 8.9 The identification of C^+p-ifi] as the dual space to C™'"'"^ induces an iso- 
morphism of Grassmannians 

Gr(m, (C"^+^)*) Gr(p,C"*+f). 

For any s G P^, this restricts to an isomorphism of the Schubert varieties, 
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Chapter 9: 

The Shapiro Conjecture for Rational 
Functions 

We continue our study of the Shapiro Conjecture, which asserts that if f2(t), . . . , fmif) 
are polynomials of degree m+p—1 whose Wronskian is has degree mp with distinct 
real roots, then the linear span of f2if), ■ ■ ■ , fmif) is real. We consider an apparently 
degenerate case, that of rational functions, which is when m = 2. 

Eremenko and Gabrielov [M] originally gave a proof of the Shapiro Conjecture when 
m = 2 using essentially the uniformization theorem from complex analysis. This proof 
was also complicated and challenging for this reader. They subsequently found a second, 
significantly more elementary proof [32] ■ We will first discuss that proof, and then a gener- 
alization concerning rational functions that are constant on prescribed sets, which leads to 
a generalization of the Shapiro Conjecture that we will discuss in Chapter [iTl 

9.1 The Shapiro Conjecture for rational functions 

The Shapiro conjecture for m = 2 asserts that if fi{t) and /2(t) are univariate polynomials 
whose Wronskian 

wr(A,/2) = mhit) - h{t)m (9.1) 

has only real roots, then the complex linear span (/i,/2) is real in that there are real 
polynomials gi and g2 with (/i,/2) = {91,92)- 

This has a natural interpretation in terms of rational functions. The quotient of two 
univariate polynomials /i and /2 defines a rational function p : — which on C C is 

p: t ^ m/f2{t) . 

Since 

^ m)f2{t) - h{t)m ^ wr(A,/2) 

if /i and /2 are relatively prime, the critical points of p in P^ are the roots of their Wronskian. 

Two rational functions pi, p2'- P^ — > P^ are equivalent if they differ by a fractional linear 
transformation on the target P^. Since a fractional linear transformation on /1//2 is a change 
of basis in the linear span (/i, /2), an equivalence class of rational functions is simply a two- 
dimensional space of polynomials. An equivalence class is real if the corresponding linear 
space is real. We state the theorem of Eremenko and Gabrielov. 

Theorem 9.1 (Shapiro's conjecture for rational functions |34L 132] ) If a rational func- 
tion p: P^ — >■ P-"^ has only real critical points, then p is equivalent to a real rational function. 
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Theorem [12] and the Grassmann duahty of Theorem 18.91 ensure the existence of a polyno- 
mial ^o{t) G IR2p[^] with only real roots such that every space of polynomials with Wronskian 
$o(^) is real, and there are exactly #2,p = ];^{^p) such spaces of polynomials. The elemen- 
tary proof of Theorem 19.11 analytically continues these #2,p real spaces of polynomials as 
the 2p distinct real roots of $o(^) vary. This continuation will produce fewer than ^2,p real 
spaces of polynomials for any $(t) only if some of the spaces become complex during the 
continuation. But this can happen only if two spaces of polynomials first become equal 
during the continuation. 

The proof shows that such a collision cannot occur by associating discrete objects called 
nets, to the real rational functions that are distinct from each of the #2,p spaces of polyno- 
mials with Wronskian $o(^)) and which are preserved under a continuation that varies the 
roots of $o(^)- Thus no collisions are possible, which will imply Theorem 19.11 

9.1.1 Continuity and nets of rational functions 

A point p in the Grassmannian Gr(2, Cp+i[t]) ~ Gr(2,p+2) is a two-dimensional space V of 
univariate polynomials of degree at most p+1. Each such point gives an equivalence class of 
rational functions p : — )■ of degree p+1— 6, where 6 is the maximum degree of a common 
factor of the polynomials in V. Working with this equivalence is awkward, so we will instead 
use the Stiefel manifold, Stp+i, which is a G'L(2, R)-fiber bundle over Gr(2, Cp+i[t]). 

The points of Stp+i are pairs of nonproportional univariate polynomials of degree at 
most p+1. Hence Stp+i is an open subset of M^^+^, with coordinates the coefficients of 
the polynomials / and g. We give Stp+i the subspace topology. The association Stp+i 3 
(/) 9) ' — ^ f /9 defines a map (written n) from Stp+i to the space of rational functions. While 
this map is not continuous as a map of spaces, it does have the weak continuity property 
given in Proposition 19.21 below. 

Let Z C Stp+i be the locus of pairs (/, g) with either 

deggcf(/, (?) > or deg(/) < d and deg(5') < p+1. 

That is, / and g either have a common root in C or else a common root at 00, and thus 
define a rational function f /g oi degree less than p+1. 

Proposition 9.2 Let {(pj \ j G N} C Stp+i \ Z be a sequence of points that converges to 
some point if = {f\g) G Z . Let zi, . . . ,Zk be the common roots of f and g (including 00 if 
deg(/) and deg{g) are both less than d). Then the sequence of functions {TT{ipj) \ j G N} 
converges to 7r{(f) uniformly on compact subsets of P^ \ {zi, . . . , Zk}. 

We give an elementary proof of this proposition. Let K G he a compact subset 
disjoint from the common roots {zi, . . . ,Zk} of / and g. We may cover P^ by the standard 
affine charts Co and Coo whose coordinates are t and 1/t, respectively. Then K = Kq U K^o, 
where Kq C Cq and Koo C Coo are compact subsets of the two affine charts. It suffices to 
show that the sequence of functions {iT{(pi)} converges uniformly to vr((/?) on each set Kq 
and i^oo- 

Now Kq is itself covered by compact sets Kq and Kq, where Kq contains no root of 
/ and Kq contains no root of g. Removing finitely many members of the sequence {(pi = 



100 



(/i) I ^ ^ N}, we may assume that no fi has a root in Kq and no gt has a root in K^. As 
{fi,gi) converges to {f,g) in Stp+i, and no gi has a root in Kq, both sequences of functions 

{Mt) I z G N} and {(^7.(t))-' M G N} 

are uniformly bounded in Kq. Therefore, the sequence of functions 

Mt) 



9^{t) 



is uniformly bounded and converges pointwise on the compact set Kq to f(t)/g(t). Thus 
this convergence is uniform on Kq, and it remains uniform under C Co C P^. The same 
arguments work for Kq, as well as K^o, which proves the proposition. ^^^iX;^ 

This proposition is half of the engine of this proof of Eremenko and Gabrielov. The other 
half is the asymptotic proof of Shapiro's conjecture, Theorem 18.21 

We now explain how to associate an embedded graph with distinguished vertices to each 
real rational function. Let Rp+i be the set of nonconstant real rational functions of degree 
at most p+1, all of whose critical points are real. If p G Rp+i, then p~^(RP^) C defines 
an embedded (multi-) graph F with the following properties: 

(i) r is stable under complex conjugation and MP^ C F. 
Call any edge in F \ MP^ an interior edge. 

(ii) The vertices of F lie on MP^ and are the critical points of the rational function p. The 
valence of a vertex is even and it equals twice the order of ramification of p at the 
critical point, which we call the local degree of F at the vertex. 

The set-theoretic difference P^\F is a union of 2d cells, where d is the degree of p. The closure 
of each cell is homeomorphic to a disc, and the boundary of each cell maps homeomorphically 
to MP^. This is because the cells (and their closures) are the inverse images of one of the 
two discs in P^ \ MP^ (or their closures), and there are no critical points in the interior of 
any cell. We deduce the following additional property of these multi-graphs. 

(iii) No interior edge of F can begin and end at the same vertex. 

Indeed, if an interior edge e begins and ends at the same vertex, then r(e) = MP^ as v is the 
only critical point on e. But then e must be the boundary of any cell adjacent to e, which 
implies that F consists of only two cells and one edge e and so p has degree 1, and in fact e 
was not an interior edge after all. 

Here are three pictures of such embedded (multi-) graphs for quintic rational functions 
with evenly spaced critical points. We have drawn MP^ as a circle with the upper half plane 
in its interior. The point a/— 1 is at the center of the circle, is the point at infinity, 

and complex conjugation is inversion in the circle. 
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We seek to analytically continue rational functions whose Wronskians lie in a curve of 
polynomials | z G [0,1]} where each ^zif) has degree 2p with distinct real roots 

s\{z\ s^iz)-, • • • , S2p{z\ and where each Si is a continuous function of z. The vertices of the 
graph p~^(MP^) associated to a rational function p with Wronskian are labeled by 

these roots, or equivalently by the numbers 1,2,..., 2p. Since the relative order of these 
roots S\{z)^ S2(^), . . . , s^piz) does not change as z varies (because each polynomial has 
distinct roots), we may capture this information by labeling only one root, say ^1(2;) (which 
is a vertex of the corresponding graph), and assuming that the roots are ordered in a manner 
consistent with a fixed orientation of MP^. It is these labeled graphs that we wish to consider 
up to isotopy (deformation in P^). 

Definition 9.3 A net is an (isotopy) equivalence class of such embedded multi-graphs in 
P^ satisfying (i), (ii), and (iii), with a distinguished vertex. 

Here are the five nets with 6 vertices, each with local degree 2 at every vertex. These 
correspond to rational functions of degree four with simple ramification. 




The uniform convergence of Proposition 19.21 implies a certain continuity of nets. Two 
subsets X, y C P^ have Hausdorff distance e if every point of X lies within a distance e of 
Y and vice- versa. This gives the Hausdorff metric on subsets of P^. 

Proposition 9.4 Let {^Pj} C Stp+i be a convergent sequence with limit (p. Then the sets 
{'K[ipj)^^{WF^)} converge in the Hausdorff metric to the set {7r((^)"^(]RP^)}. 

We deduce two corollaries from this proposition. 

Corollary 9.5 Suppose that {pz \ z ^ [0,1]} is a continuous path in i?2p where each pz has 
2p critical points. Let vi (z) be a continuous function of z which is equal to a critical point 
of Pz, for each z. Then the net of the pair 

{Pz\W), v^iz)) 

does not depend upon z. 

Corollary 9.6 Suppose that {ipz \ z & [0,1]} is a continuous path in the Stiefel manifold 
St2p. Suppose that for < z we have Ti{'~Pz) ^ R2p (^nd T^{<p>z) has distinct critical points 
vi{z),V2{z), . . . ,Vn{z), where Vi{z) is a continuous function of z, and also that, at z = 0, 
we have Vi{0) = ^2(0), but all other critical points are distinct. Then the degree of 7r{(fz) is 
constant for z G (0, 1], and deg(7r((y9o)) < deg(7r(<y5i)) if and only if the net of 7T{(pi) has an 
interior edge between fi(l) and V2{1). 
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Here are two nets for quartic rational functions ( 19. 2p as two of their vertices collide. 




In the first row, there is an edge (in fact two) of r\MP^ between the vertices V2 and vi. This 
edge collapses in the limit as V2 approaches f 1, which causes two regions of \r to collapse, 
so that the limiting net has two fewer regions in its complement, and thus corresponds to a 
rational function of degree 3. There is no such edge in the nets of the second row, and the 
limiting net still has eight regions and thus its rational function still has degree four. 

Proof of Corollary \9.(A The degree of a rational function p G is one-half the number of 
cells in the complement \ p~^(RP^) of the net of p. Set pz '■= 'n'ifz)- The only way for 
the number of cells in the complement of the net of pz to change at some zq ^ [0, 1] would 
be if some edge of p~^(]RP^) collapsed as z — )■ zq. Since the vertices of p~"'^(]RP^) are the 
critical points vi{z), . . . , Vn{z), which are distinct for z G (0, 1], we see that the degree of pz 
is constant for z G (0, 1]. 

If the degree of po is less than that of pi, then some edges which bound a cell must 
disappear in the limit as 2; — > 0. But this can only happen if a cell is bounded by edges 
between vi{z) and V2{z), as they are the only critical points which collide in the limit as 
2 — )■ 0. By (iii), such a cell must be bounded by more than one edge which implies that 
there was an interior edge between fi(l) and ^2(1) outside of MP^. This shows the necessity 
of such an interior edge between fi(l) and V2{1) for the degree to drop. 

For sufficiency, note that if there is an interior edge between fi(l) and ^2(1), then it 
must collapse in the limit as z — ?■ for otherwise condition (iii) for nets would be violated. 




9.1.2 Schubert induction for rational functions 

In Chapter [8], we used Schubert induction to construct a sequence of points Si, . . . , Smp G K 
and sufficiently many real points in each Schubert variety Xa-F,(0) which also lie in X^,{si) 
for i = Without re- running that proof, we will describe what that construction 

gives for rational functions. 

The construction of Theorem 18 . 21 relevant for rational functions was in the Grassmannian 
Gy{p,p+2). Under the Grassmann duality of Theorem 18. 9[ this becomes a construction 
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in Gr(2, Cp+i[t]) and involves a Schubert variety XaE,{t) with a G ('^2^')- In fact, the 
statements become identical after replacing F,{t) by E,(t). We will briefly recall these 
statements in this setting. 

A point in the Schubert cell X^E,{s) where a: ai < 0:2 is a two-dimensional subspace 
{f,g) of polynomials of degree p+1 such that 

ords(/) = p+2—a2 and ords(5') = p+2—ai. (9.3) 

In particular, 

and 

(Here, a'^Wb means that a'^ divides b, but a^~^^ does not divide b.) 
Given {f,g) G X°i?,(s), a consequence of (19. 3p is that 

Wr(/,<7) = f{t)g{t)-f{t)g'{t) 

vanishes to order p+l—a2+p+2—ai = 2p—\a\ at s. 

By Lemma there exist real numbers Si, . . . , S2p (in fact, we have Si > ■ ■ ■ > S2p > 0) 
such that for all a G ('^2^') , the intersection 

H 

X^E.iO) n []X^E.isi) (9.4) 

i=l 

is transverse, and it consists of 6{a) real points. Any point (/, g) in the intersection (19. 4p 
will have Wronskian 



f'it)g{t)-f{t)g'{t) = constant -t^P-H. 



(t - Si) 



i=l 



In fact, as noted in Remark 18. 6[ the proof of Lemma [8.41 did much more. Suppose that 
|a| > 0, and define 

:= «!— 1 < ^2 and := ai < a2—l , 

when possible. (/3^ is only defined if 1 < ai and is only defined if ai+l < 0:2) Then the 
proof constructed 6{a) = S{f3^) + S{f3^) families {{fz,gz) I ^ ^ [0, s\a\]} of polynomials such 
that 

1. For z^O, (/.,^7.) GX°. 

H-i 

2. f'Mgzit) - mg'M = constant ■ t^^H"! ■ ( J] (t - s,)) ■ (t - z). 

1=1 

3. Exactly 6{/3^) of these families began in X^ii?,(0). That is, for 5(/3*) of these families, 
we have (/cfi-o) e X^iE,(0). 
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9.1.3 Schubert induction for nets 



The main idea in the proof is that the rational functions constructed in Lemma 18.41 each 
have different nets. 



Theorem 9.7 The 6{a) rational functions in X°E,{0) constructed in Lemma \8^\ each have 
different nets. 

Suppose that (/, g) is a point in the intersection (19. 4p where / and g satisfy (19. 3p for 
s = 0. Then its Wronskian vanishes to order 2p— |q;| at and to order 1 at the points 
Si, . . . , s\a\. In particular, is the only common zero of / and g. Removing the common 
factor tP+2-"2 from both / and g gives relatively prime polynomials of degree at most 1. 
Indeed, if / and g had a common root s, then a linear combination of them would vanish 
to order at least 2 and so their Wronskian would vanish to order at least 2 at s. Then the 
rational function r := f /g has degree a2—l with Wronskian 



constant ■ t 



a2—ai—l 



The point (/, g) corresponds to a unique path in the Bruhat order from 12 to a in the 
Bruhat order. We claim that this path may be recovered from the net p~^(MP^) of p. 

Indeed, consider the ith step in the construction, when the critical point Si was created. 
By Corollary 19.61 the interior edge from Sj has other endpoint if the degree of the rational 
function increased at the ith step, and if its degree did not increase, then the other endpoint 
of that edge is at some critical point with Sk > Si and so k < i. Subsequent steps in the 
construction will not affect an edge from Sj to with k < i, but an edge between and Sj 
may be moved to an edge between Sj and Sj where j > i. 

Thus, the degree of the rational function increased at step i if and only if the other 
endpoint of an interior edge from Sj is at Sj with j > i. If /3 < /?' is the ith step in the chain 
corresponding to our rational function p, then 

1. /32 + 1 = /32; so the degree of the rational function increased, if the interior edge from 
Si has endpoint sj with j > i (so sj < Sj), and 

2. /?! + 1 = so the degree of the rational function did not increase, if the interior edge 
from Si has endpoint Sk with k < i (so Sk > Si). 

This completes the proof. ^^X;^ 

Figure WTl illustrates the formation of the nets during the Schubert induction for quartic 
rational functions, as well as the recursion for 6{a). 

We complete the proof of the Shapiro conjecture for rational curves. 

Theorem 9.8 Let <l>(t) be a real polynomial of degree 2p whose roots are all real. Then 
there are exactly 6{p+l,p+2) real equivalence classes of rational functions with Wronskian 
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Proof. Let Si > S2 > ■ ■ ■ > G M be the numbers such that the intersection 

2p 



transverse with all points real. Each point in the intersection is an equivalence class of 
rational functions with Wronskian 

i=l 

Let {$2 I z G [0, 1]} be a continuous family of polynomials of degree 2p all with distinct 
real roots and with = We attempt to analytically continue each point in the 

fiber Wt~^{^z) from 2; = to 2; = 1. The only way this continuation could fail would be if 
it encountered a fiber Wt~^{^z) with a multiple point. Then two of the rational functions 
in this fiber would have to coincide. In particular two would have the same net, (where we 
have labeled the nets by the root of the $z(t) corresponding to Si). This implies that two 
of the original rational functions in Wr ~^($o) have the same net, by Corollary 19.51 But this 
contradicts Theorem 19.71 ""^^^\r~ 



9.2 Rational functions with prescribed coincidences 

The results of Section 19.11 can be used to prove a result about real rational functions that 
satisfy a certain interpolation condition. This is is due to Eremenko, Gabrielov, Shapiro, 
and Vainstein ^35j, and may be interpreted in the Grassmannian Gr(p, p+2) as an appealing 
generalization of the Shapiro conjecture. We will discuss this generalization, the Secant 
Conjecture, in Chapter [TTl 

We first describe the interpolation problem. Let Ai, . . . ,An be disjoint finite subsets of 
where the set has 1 + ctj elements with 1 < Oj < p and ai + ■ ■ ■ + a„ = 2p. Write 
a for this sequence (oi, . . . , a„) of numbers. The interpolation problem is to determine the 
equivalence classes of rational functions p of degree p+1 that satisfy 

p\Ai is constant for i = 1, . . . ,n . 

There are in fact finitely many such equivalence classes of rational functions when the sets 
Ai are general. We will later prove this finiteness and show the number of equivalence classes 
is a Kostka number |38, p. 25], [761 1)6]. Heuristically, there are finitely many equivalence 
classes because the condition that a rational function is constant on a finite set of l+a 
elements gives a equations in the Stiefel coordinates for rational functions. 

A collection of sets Ai C MP^ for i = 1, . . . ,n is separated if there exist disjoint intervals 
Ji, . . . , 4 of MP^ with Ai C Ji for i = 1, . . . , n. 

Theorem 9.9 ([35]) Let a — (Oil , . . . , 

) with 1 < tti < p and ai + ■ ■ ■ + = 2p. For 
general separated subsets ofMF^, Ai, . . . , An with \Ai\ = l+a,, there are exactly real 
equivalence classes of rational functions p such that 

p\a, is constant for i = 1, . . . ,n . (9.5) 
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Given separated subsets Ai, . . . , An of MP^, the proof first constructs such a real rational 
function satisfying (19. 5 p for every net with a certain property, described below (19 .Op . Next, 
it relates this interpolation problem to a problem in the Schubert calculus that has 
solutions, and finally shows that Kg, is the number of nets with the property (19. 6p . This will 
imply that we have constructed all the solutions for general sets Ai. 

Theorem 19.91 generalizes Theorem 19.11 Suppose that we have subsets {A^ | ,2 G (0, 1]} of 
MP^ depending continuously on z, each of cardinality a + 1, whose limit as 2; — )■ consists 
of a single point, 

lim Az = {s} . 

2—^0 

Suppose further that we have a family {pz | 2 G [0, 1]} of rational functions that depends 
continuously on z, and such that for z > 0, pz is constant on A. Then po will have a critical 
point at s of order at least a. 

In this way. Theorem 19.91 implies Theorem 19.11 by simply considering the limit as the 
points in each set A^ collide. In fact, this analysis will enable us to deduce a stronger form 
of Theorem 19.11 

Theorem 9.10 Let ai, . . . , a„ be positive integers with 1 < ai < p and ai + ■ ■ ■ + a„ = 2p. 

Then every rational function of degree p + 1 with n real critical points of multiplicities 
ai,...,a„ is real. There are exactly K^, classes of such rational functions, and the corre- 
sponding Schubert varieties meet transver sally. 

9.2.1 Proof of Theorems MM and [930] 

Let TZp+i be the set of rational function of degree p+1 with exactly 2p critical points of 
multiplicity 1. We use two consequences of the work in Section [9A1 

1. If pi and p2 are rational functions in TZp+i with the same critical points and isotopic 
nets (where we use the same critical point for each net), then pi is equivalent to p2- 

2. For every net r C with a given vertex set V (and distinguished vertex vi G V), 
there is a unique equivalence class of rational functions in 7^p+i with critical set V and 
net (with distinguished vertex vi) isotopic to F. 

Actually, we only showed the second point for nets with local degree 2 at each critical 
point. The stronger statement follows from results in [M]. To prove Theorem 19. 9[ fix 
separated subsets Ai, . . . , of MP^ satisfying the hypotheses. Choose 2p additional points 
Si, . . . , S2p where, for each z, of the points interlace the Oj+l points of A^. Figure 19.21 
shows an example when p = 5 and a = (3, 2, 2, 2, 1). Each point Sj lies between two points 
of some set Ai. Write [xj,yj\ for the interval that contains Sj and note that Xj,yj G Ai, for 
some i. 

Consider nets with the vertices Si, . . . , S2p that have local degree 2 at each vertex and 
satisfy the additional hypothesis: 

There are no edges between points Sj and Sk that interlace the same set A^ (9.6) 
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A, 

Figure 9.2: Interlacing points. 

There are five nets satisfying (19. 6 p for the points Ai of Figure 19.21 (we only draw the edges 
in the upper half plane, which is the interior of the circle). 




(9.7) 



Suppose that we have critical points Si,...,S2p interlacing the sets Ai, where all the 
points Si are fixed and chosen arbitrarily within their intervals, except for one, Sj, which is 
allowed to vary within its interval [xj, yj\. Also fix an isotopy class F of nets satisfying (19. 6p . 
For each s G [xj^Uj] let ps G IZp+i be a rational function with the critical points si, . . . , S2p 
and net F. We may suppose that {ps \ s G [xj,yj]} is a continuous family. 

Lemma 9.11 There exists a point Sj G [xj,yj] such that ps^{xj) = ps^ivj)- 

To see this, we may assume that ps is normalized so that Ps{s) = and ps maps the 
interior edge of F terminating in sj to the interval [— oo, 0] in MP^. Then the difference 

is positive when s is near t/j and negative when s is near xj , so it changes sign on the interval 
[xj,yj] and therefore takes the value zero at some Sj G [xj,yj]. We give a picture below. 
The arrows point in the direction of increase of Ps{t) for t in the net. 




Ps{Xj) - PsiVj) > Ps{Xj) - psivj) < 
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Lemma 9.12 If T is a net satisfying ( 19. 6p . then there is a choice of critical points si, . . . , S2p 
interlacing the points of the sets Ai such that every rational function p of degree p+1 with 
the net T and critical points Sj satisfies 

p\a. is constant for i = 1, . . . ,n . (9-8) 

The set of possible critical points s = (si, . . . , S2p) G (MP^)^^ interlacing the sets Ai forms 
the interior of a closed cube 



Q ■= [Xl,yi] X [X2,y2] X ■ • • X [X2p,y2p] . 



By Lemma 19.111 for every j, the function (pj{s) := Ps{xj) — Ps{yj) (defined as described in 
the proof of Lemma [9.111) is positive on the face Sj = yj and negative on the face Sj = Xj. 
This implies (via variant of Brower's fixed point theorem) that there is a point s in the 
interior of Q where (Pj{s) = for all j, that is Ps{xj) = PsiVj) for all j. Since these intervals 
interlace the sets Ai, this implies (19. 8p . ~~>^^Nr 

The next step is to show that the number of nets satisfying (19.61) for sets Ai,...,An 
where A^ has l+Oj members and ai + ■ ■ ■ + a„ = 2p is the Kostka number K^- This Kostka 
number is the number of Young tableaux of shape 2 x p and content a |2Hl p. 25]. These 
are arrays consisting of two rows of integers, each of length p such that the integers increase 
weakly across each row and strictly down each column. For example, here are the five Young 
tableaux of shape 2 x p and content (3, 2, 2, 2, 1), showing that -^'(3,2, 2,2,1) = 5. 
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(9.9) 



We only describe the map from nets to Young tableaux. Given a net satisfying (19. 6p . 
we will successively place integers into a left-justified two-rowed array while traversing MP^. 
This starts from the first (in the canonical ordering on MP^) point in Ai and begins with an 
empty array. When a critical point Sj is encountered, there will be an interior edge of the 
net with endpoint Sj. Place the integer j in the second row if the other endpoint of that 
edge has already been encountered, and in the first row if it has not been encountered. For 
example, the tableaux in (19. 9 p correspond, in order, to the nets in fl9.7p . (There, the order 
on MP^ is counterclockwise on the circles.) 

This bijection shows that we have constructed Ka equivalence classes of rational functions 
satisfying (19. 5p . To complete the proof of Theorem 19. 9 [ we first show that the Kostka number 
Ka is the expected number of equivalence classes of rational functions satisfying ( 19. 5p . and 
then that there is some choice of the sets Ai for which there are exactly Ka equivalence 
classes of rational functions. This last step will also prove Theorem 19.101 

Recall that a polynomial / of degree p+1 may be considered to be a linear map on 
C^+^ so that the composition with the rational normal curve 7(t) : C — t- C^^"^ gives the 
polynomial f{t). We used this to relate ramification to osculating flags as in Section 1831 A 
two-dimensional space {f,g) of polynomials gives a map 



C 



lit) 



if,9) 
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The kernel H of the map C^+^ — — > corresponds to {f,g) under Grassmann duahty. 

Suppose that the rational function p = f/g is constant on some finite subset, A. This 
means that the line {f{a),g{a)) C is constant, for a & A. This implies that in C^+^ we 
have 

{H,j{a)) = {H,^{b)), 

for any a,b & A. In particular, H has exceptional position with respect to the |A|-plane 
S{A) := (7(a) I a G A) in that the two subspaces do not span C^+^. 

Thus, the equivalence classes of rational functions p of degree p + 1 that satisfy (19. 8p 
correspond to the p-planes H in C^"*"^ such that 

span{H,S{Ai)) ^ Cp+^ for i = 1, . . . , n . (9.10) 

Those H which satisfy (I9.10p are the points of an intersection of Schubert varieties. Let 
aa := (2, 3, ... , a+1, a+3, . . . ,p+2) e (^^+^1). Then X^^F, consists of those H e Gi{p,p+2) 
such that 

span(/7,F,+i) ^ C^^^ 

We will also write Xcr^Fa+i for this Schubert variety, which has dimension \aa\ = 2p—a. Thus 
the solutions to the interpolation problem (19. 8 p correspond to the intersection of Schubert 
varieties 

x^^^S{A,) n x,^^s{A,) n ■■■ n x,^^s{A^), 

which is expected to be zero dimensional. These are special Schubert varieties, so the 
expected number of points in this intersection may be computed using the Fieri formula, 
and it is the Kostka number Kg, p. 25]. 

All that remains to show is that there is some choice of the sets Ai for which there 
are finitely many equivalence classes of rational functions satisfying the interpolation con- 
dition (19.81) . We show that indirectly, by passing to the limit as each set Ai collapses to a 
single point, Sj. If we consider the rational functions for a given net in this limit, we see 
that the limiting rational function still has degree p+1, by Corollary 19.61 as no interior edges 
were collapsed in the limit, by Condition (19. 6p . The limiting rational function has a critical 
point at each Sj of multiplicity a^, and is necessarily real. 

It is easy to see that there are still nets that have critical points of multiplicity at 
points Si — the same bijection works. For example, here are the nets of rational functions of 
degree five with critical points having multiplicities (3, 2, 2, 2, 1). 



Moreover, the corresponding intersection of Schubert varieties is expected to have points. 
Thus we have constructed the expected number of real rational functions with the desired 
critical points. The proof of Theorem 19.101 now follows from the results of [34] , which gave 
a bijective correspondence between nets and rational functions. 
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Chapter 10: 

Proof of the Shapiro conjecture 

The Shapiro Conjecture was proven by Mukhin, Tarasov, and Varchenko in a preprint 
of November 2005 [81j. Like the proofs of the special case of m = 2 by Eremenko and 
Gabrielov [SU [32j (discussed in Chapter [9]) the proof in the general case did not use much 
algebraic geometry. Instead it used results in mathematical physics, specifically integrable 
systems, with some representation theory. Unlike the rest of the material in this book, this 
chapter is not elementary. It does contain a sketch of some of the main ideas in their proof, 
but it by no means complete, and we recommend that the serious reader go to the original 
sources. An expanded account of the Shapiro conjecture and its proof appeared in the Bul- 
letin of the AMS |115] . In fact, what follows is a mildly revised version of Sections 2, 3, and 
4 of [TT5] . 

10.1 Spaces of polynomials with given Wronskian 

By Theorem 18. 7[ the general case of the Shapiro conjecture follows from the special case 
when all the Schubert conditions are equal to □, and this case is equivalent to the Wron- 
ski formulation of Theorem 11.91 A further reduction is possible, as the Wronski map 
Wr : Gr(m, m+p) — t- P™'^ is a finite map in that it has finite fibers, a standard limiting 
argument (given, for example, in Section 1.3 of [81] or Remark 3.4 of |lllj ) shows that it 
suffices to prove Theorem 11.91 when the Wronskian has distinct real roots that are sufficiently 
general. Since #m,p is the upper bound for the number of spaces of polynomials with a given 
Wronskian, it suffices to construct this number of distinct spaces of real polynomials with a 
given Wronskian, when the Wronskian has distinct real roots that are sufficiently general. 
In fact, this is exactly what Mukhin, Tarasov, and Varchenko do [8T] . 

Theorem II. 9f . If Si, . . . , Smp are generic real numbers, there are H^m,p real spaces of poly- 
nomials in Gr(m, Cm+p-i[t]) with Wronskian Y\^^i{t — Si). 

The proof first constructs 7^m,p distinct spaces of polynomials with a given Wronskian 
having generic complex roots, which we describe in Section 110. 1.11 This uses a Fuchsian 
differential equation given by the critical points of a remarkable symmetric function, called 
the master function. The next step uses the Bethe ansatz in a certain representation V of 
slmC: each critical point of the master function gives a Bethe eigenvector of the Gaudin 
Hamiltonians which turns out to be a highest weight vector for an irreducible submodule of 
V . This is described in Section ri0.2[ where the eigenvalues of the Gaudin Hamiltonians on a 
Bethe vector are shown to be the coefficients of the Fuchsian differential equation giving the 
corresponding spaces of polynomials. This is the germ of the new, deep connection between 
representation theory and Schubert calculus that led to the full statement of Theorem 17.61 
(reality and transversality) . Finally, the Gaudin Hamiltonians are real symmetric operators 
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Figure 10.1: Schematic of proof of Shapiro conjecture. 



when the Wronskian has only real roots, so their eigenvalues are real, and thus the Fuchsian 
differential equation has real coefficients and the corresponding space of polynomials is also 
real. Figure [TO . 1 1 presents a schematic of this extraordinary proof. 

10.1.1 Critical points of master functions 

The construction of #m,p spaces of polynomials with a given Wronskian begins with the 
critical points of a symmetric rational function that arose in the study of hypergeometric 
solutions to the Knizhnik-Zamolodchikov equations [52] and the Bethe ansatz method for 
the Gaudin model. 

The master function depends upon parameters s := (si, . . . , Smp), which are the roots of 
our Wronskian $, and an additional variables 

Ji. .— [^d^i , . . . , , d^i , . . . , Jy2p ! • • • ; -^1 ) • • • ) ■^{m-l)pJ • 

Each set of variables x^*^ := (x^*\ . . . , x^p) will turn out to be the roots of certain intermediate 
Wronskians. 

Define the master function S(x; s) by the (rather formidable) formula 

n 

n n - 4^^)' ■ n - 

^=1 ^<j<k<ip l<j<k<mp , 

n-1 ip (m—l)p mp 

n n n (4" - -t") ■ n im-"' - 

i=l j=l k=l j=l k=l 

This is separately symmetric in each set of variables x''*^ The Cartan matrix for slm appears 
in the exponents of the factors {x^^ —xi^'^) in (110. ip . This hints at the relation of these master 
functions to Lie theory, which we do not discuss. 
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The critical points of the master function are solutions to the system of equations 



1 d 

"Trm'='(^'^) = ^ « = l,...,m-l, j = l,...,ip. (10.2) 

^ dxj ' 

When the parameters s are generic, these Bethe ansatz equations turn out to have finitely 
many solutions. The master function is invariant under the group 

S := Sp X X ■ ■ ■ X iS(m_i)p , 

where is the group of permutations of {1,...,A^}, and the factor Sip permutes the 
variables in x*^*\ Thus S acts on the critical points. The invariants of this action are 
polynomials whose roots are the coordinates of the critical points. 

Given a critical point x, define monic polynomials gx := {gi, ■ ■ ■ ,gm-i) where the com- 
ponents x*^*^ of X are the roots of gi, 

ip 

g- ■= Y[(t - xf) for i = l,..., m-1 . (10.3) 

Also write gm for the Wronskian, the monic polynomial with roots s. The master function 
is greatly simplified by this notation. The discriminant Discr(/) of a polynomial / is the 
square of the product of differences of its roots and the resultant Res(/, h) is the product of 
all differences of the roots of / and h [22]. Then the formula for the master function (110. ip 
becomes 

m j 1 

S(x;s) 

1=1 / i=l 

The connection between the critical points of S(x; s) and spaces of polynomials with 
Wronskian $ is through a Fuchsian differential equation. Given (an orbit of) a critical point 
X represented by the list of polynomials gx, define the fundamental differential operator Dx 
of the critical point x by 

^-lnfr^))-f4-lnf^))f^-ln(..)), (10.5) 



Y[BiscT{g,) / II Res{g„g,+,) . (10.4) 



^dt \g.rn-i^ ^ ^dt \giJJ\dt 
where ln'(/) := ^ In /. The kernel V"x of -Dx is the fundamental space of tlie critical point x. 
Example 10.1 Since 

we see that gi is a solution of D^. It is instructive to look at and when m = 2. 
Suppose that g a solution to Dx that is linearly independent from gi. Then 



°-(l-0)(l-'"'<«'))-(l-->'(^))(«'-^») 
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This implies that 



so $ = Wr {g,gi), and the kernel of is a two-dimensional space of functions with Wron- 
skian $. 

What we just saw is always the case. The following result is due to Scherbak and 
Varchenko [96] for m = 2 and to Mukhin and Varchenko [SSJ §5] for all m. 

Theorem 10.2 Suppose that is the fundamental space of a critical point x of the master 
function S with generic parameters s which are the roots of ^. 

1. Then ^ Gr(m, Cm+p-i[t]) has Wronskian $. 

2. The critical point x is recovered from in some cases as follows. Suppose that 
fi,..-,fm di^G monic polynomials in with deg/j = p—l+i, each fi is square-free, 
and that the pairs fi and /j+i are relatively prime. Then, up to scalar multiples, the 
polynomials gi, . . . , gm-i in the sequence gx are 

fi, Wr(/i,/2), Wr(/i,/2,/3), ••• , Wr(/i,...,/J. 



Statement (2) includes a general result about factoring a linear differential operator into 
differential operators of degree 1. Linearly independent C°° functions fi,...,fm span the 
kernel of the differential operator of degree m, 



det 



//i 



V/: 



/2 



r{m) 
J2 



fm 1 \ 



£1 _d_ 

■I'm. dt 



(m) 
m 
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If we set gi := Wr (/i, . . . , fi), then fllO.Sp is a factorization over C(t) of this determinant 
into differential operators of degree 1. This follows from some interesting identities among 
Wronskians shown in the Appendix of [83j. 

Theorem 110.21 is deeper than this curious fact. When the polynomials gi,...,gm and 
$ are square-free, consecutive pairs are relatively prime, and s is generic, it implies that 
the kernel V of an operator of the form (110. 5p is a space of polynomials with Wronskian $ 
having roots s if and only if the polynomials gi, . . . ,gjn come from the critical points of the 
master function (110. ip corresponding to $. 

This gives an injection from iS-orbits of critical points of the master function S with 
parameters s to spaces of polynomials whose Wronskian has roots s. Mukhin and Varchenko 
showed that this is a bijection when s is generic. 

Theorem 10.3 (Theorem 6.1 in [84]) For generic complex numbers s, the master func- 
tion S has nondegenerate critical points that form ^rn,p distinct orbits. 
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The structure (but not of course the details) of their proof is remarkably similar to the 
structure of the proof of Theorem 18.21 they allow the parameters to collide one-by-one, and 
study how the orbits of critical points behave. Ultimately, they obtain the same recursion 
as in fl8.12p . which mimics the Fieri formula for the branching rule for tensor products of 
representations of sl^ with its last fundamental representation V^^_-^. This same structure is 
also found in the main argument in [33]. In fact, this is the same recursion in a that Schubert 
established for intersection numbers S{a), and then solved to obtain the formula (11. 4p in \98\ . 

10.2 The Bethe ansatz for the Gaudin model 

The Bethe ansatz is a general (conjectural) method to find pure states, called Bethe vec- 
tors, of quantum integrable systems. The (periodic) Gaudin model is an integrable system 
consisting of a family of commuting operators called the Gaudin Hamiltonians that act on 
a representation V of simC In this Bethe ansatz, a vector-valued rational function is con- 
structed so that for certain values of the parameters it yields a complete set of Bethe vectors. 
As the Gaudin Hamiltonians commute with the action of stm.C, the Bethe vectors turn out 
to be highest weight vectors generating irreducible submodules of V, and so this also gives a 
method for decomposing some representations V of slmC into irreducible submodules. The 
development, justification, and refinements of this Bethe ansatz are the subject of a large 
body of work, a small part of which we mention. 

10.2.1 Representations of sl^C 

The Lie algebra sI^C (or simply slm) is the space of m x m-matrices with trace zero. It has 
a decomposition 

slm = n_ © [) © n+ , 

where n+ (n_) are the strictly upper (lower) triangular matrices, and [) consists of the 
diagonal matrices with zero trace. The universal enveloping algebra Uslm of sim is the 
associative algebra generated by slm subject to the relations uv — vu = [m, v] for u,v E st^, 
where [u,v\ is the Lie bracket in 5[m- 

We consider only finite-dimensional representations of sim (equivalently, of Usim)- For 
a more complete treatment, see |12]. Any representation V of sim decomposes into joint 
eigenspaces of [), called weight spaces, 

V = ^V[f,], 

At6h* 

where, for v G V[fi] and /i G f), we have h.v = fi{h)v. The possible weights /i of representa- 
tions lie in the integral weight lattice. This has a distinguished basis of fundamental weights 
ui, . . . , Um-i that generate the cone of dominant weights. 

An irreducible representation V has a unique one-dimensional weight space that is an- 
nihilated by the nilpotent subalgebra n+ of slm- The associated weight fi is dominant, and 
it is called the highest weight of V. Any nonzero vector with this weight is a highest weight 
vector of V, and it generates V. Furthermore, any two irreducible modules with the same 
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highest weight are isomorphic. Write for the highest weight module with highest weight 
fi. Lastly, there is one highest weight module for each dominant weight. 

More generally, if V is any representation of slm and /i is a weight, then the singular 
vectors in V of weight /i, written sing(y[yu]), are the vectors in V[fi] annihilated by n+. 
If w G sing(\/[/i]) is nonzero, then the submodule Uslm-v it generates is isomorphic to the 
highest weight module V^. Thus V decomposes as a direct sum of submodules generated by 
the singular vectors, 

V = 0t/5U.sing(r[/i]), (10.6) 
/i 

so that the multiplicity of the highest weight module in is simply the dimension of its 
space of singular vectors of weight /i. 

When \^ is a tensor product of highest weight modules, the Littlewood-Richardson 
rule [3S] gives formulas for the dimensions of the spaces of singular vectors. Since this 
is the same rule for the number of points in an intersection (17. ip of Schubert varieties from 
a Schubert problem, these geometric intersection numbers are equal to the dimensions of 
spaces of singular vectors. In particular, if V^-^ ^ is the defining representation of slm 
and = A'""^ ^1 — (these are the first and last fundamental representations of 

slyn), then 

dimsing(V;«-P[0]) = (10.7) 

It is important to note that this equality of numbers is purely formal, in that the same 
formula governs both numbers. A direct connection remains to be found. 

10.2.2 The (periodic) Gaudin model 

The Bethe ansatz is a conjectural method to obtain a complete set of eigenvectors for the 
integrable system on V := given by the Gaudin Hamiltonians (defined below). Since 

these Gaudin Hamiltonians commute with slm, the Bethe ansatz has the additional benefit 
of giving an explicit basis for sing(\^[/i]), thus explicitly giving the decomposition (110. 6p . 

The Gaudin Hamiltonians act on V"^"_^ and depend upon n distinct complex numbers 
Si, . . . ,Sn and a complex variable t. Let glm be the Lie algebra of m x m complex matrices. 
For each i,j = 1, . . . ,m, let Eij G Qlm be the matrix whose only nonzero entry is a 1 in 
row i and column j. For each pair consider the differential operator Xij{t) acting on 
Yf"' -valued functions of t, 

^ n ^(fc) 

fc=l 

where E'j^'* acts on tensors in V^_^ by Ej^i in the kth factor and by the identity in other 
factors. Define a differential operator acting on V^_^-valued functions of t, 

M := ^Sgn((T) Xi,^(i)(t) X2,a{2){t) ■ ■ ■ Xm,a{m){t) , 

o-e5 
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where S is the group of permutations of {1, . . . , m} and sgn(cr) = ± is the sign of a permu- 
tation a E S. Write M in standard form 



These coefficients Mi{t), . . . , Mm{t) are called the (higher) Gaudin Hamiltonians. They are 
linear operators that depend rationally on t and act on V"^_^. We collect together some of 
their properties. 

Theorem 10.4 Suppose that si, . . . , s„ are distinct complex numbers. Then 

1. The Gaudin Hamiltonians commute, that is, [Mj(-u), Mj{v)] = for all i,j = 1, . . . ,m 
and M, G C. 

2. The Gaudin Hamiltonians commute with the action of slm on V^_^. 



Proofs are given in [70], as well as Propositions 7.2 and 8.3 in [80], and are based on 
results of Talalaev [125] . A consequence of the second assertion is that the Gaudin Hamilto- 
nians preserve the weight space decomposition of the singular vectors of V^_^. Since they 
commute, the singular vectors of Vf"' have a basis of common eigenvectors of the Gaudin 
Hamiltonians. The Bethe ansatz is a method to write down joint eigenvectors and their 
eigenvalues. 



10.2.3 The Bethe ansatz for the Gaudin model 

This begins with a rational function that takes values in a weight space V^_J/i], 

t; : C' X C" ^ " [fA ■ 

This universal weight function was introduced in [95] to solve the Knizhnik-Zamolodchikov 
equations with values in V^_J/i]. When (x, s) is a critical point of a master function, the 
vector f (x, s) is both singular and an eigenvector of the Gaudin Hamiltonians. (This master 
function is a generalization of the one defined by fllO.ip .) The Bethe ansatz conjecture for 
the periodic Gaudin model asserts that the vectors v{'x, s) form a basis for the space of 
singular vectors. 

Fix a highest weight vector Vm € K;,„_i [wm-i]- Then t>^" generates as a Uslm'^'^- 

module. In particular, any vector in V^_^ is a linear combination of vectors that are obtained 

from f ®" by applying a sequence of operators -Sf+i j, for 1 < k < n and 1 < i < m—1. The 
universal weight function is a linear combination of such vectors of weight fi. 
When n = mp, I = p{^), and /i = 0, the universal weight function is a map 

V : 2 ) X — > V^'^P fOl . 

'-^m — 1 L J 

To describe it, note that a vector Ea+i,aEb+iM- ■ ■ Ec+i^c-Vn+i is nonzero only if 

(a, 6, . . . , c) = {a,a+l, . . . ,m—2,m—l) . 
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Write Va for this vector. The vectors ui, . . . , form a basis of V^^_^. Thus only some 
sequences of operators e\^^ ^ apphed to w®"*^ give a nonzero vector. These sequences are 

completely determined once we know the weight of the result. The operator -Sf^i j lowers 
the weight of a weight vector by the root Since 

mujm-i = ai + 2^2 H + ("^-l)«m-i, (10.8) 

there are ip occurrences of -Sf+i j, which is the number of variables in x*^'). 

Let B be the set of all sequences (61, 62, ... , bmp), where 1 < 6^ < m for each k, and we 
have 

#{k \bk<i} = ip. 

Given a sequence B in B, define 

Vb ■= Vb^ ® ^ ■ ■ ■ ® Vb^^ 
mp 

— ^ P^'^) ^ 

— \^ \'^b,,+l,bk ' ' ' -^m-l,m-2 ' ^m,m-l )-^m , 

k=l 

where the operator iJ^^^-^ ■ ■ ■-E^li „j_2 ■ is the identity if bk = m. Then vb is a 

vector of weight 0, by fllO.Sp . The universal weight function is a linear combination of these 
vectors vb, 

t;(x;s) = ^wb(x;s) -tJB, (10.9) 

BeB 

where the function wb{'^, s) is separately symmetric in each set of variables x*^*^ 
To describe ws(x; s), suppose that 

z = (zW,z(2),...,z(™^')) 

is a partition of the variables x into mp sets of variables where the kth. set z^'^^ of variables 
has exactly one variable from each set x*^*) with bk < i (and is empty when bk = m). That 
is, if bk < m—1, then 

= (x^),x^+i),...,a;("^i)), (10.10) 
for some indices C;,^, . . . , Cm-i- If bk = m, set Wk{z) := 1, and otherwise 

1 1 1 

Wkiz;s) :-- 



Abk) _ Jb^+l) Am-2) _ (m-1) (m-1) _ 

•^Ctj, '^bf.+l •'^Cjn-2 -^Cm-l -^Cm-l 



in the notation (110.101) . Then we set 



mp 

w{z;s) := JJu;fc(z;s) 

k=l 



Finally, w_b(x; s) is the sum of the rational functions w{z;s) over all such partitions z of the 
variables x. ( Equivalent ly, the symmetrization of any single w{z;s).) 
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While f(x, s) ( 110. 9p is a rational function of x and hence not globally defined, if the 
coordinates of s are distinct and x is a critical point of the master function (110. ip . then 
the vector f(x, s) G K^^[0] is well-defined, nonzero and it is in fact a singular vector 
(Lemma 2.1 of [84j). Such a vector t'(x, s) when x is a critical point of the master function 
is called a Bethe vector. Mukhin and Varchenko also prove the following, which is the second 
part of Theorem 6.1 in [8^ . 



Theorem 10.5 When s G is general, the Bethe vectors form a basis of the space 

sing(C-i[0]). 

These Bethe vectors are the joint eigenvectors of the Gaudin Hamiltonians. 

Theorem 10.6 (Theorem 9.2 in |80] ) For any critical point x of the master function 
S ( 110. ip . the Bethe vector v(jsi, s) is a joint eigenvector of the Gaudin Hamiltonians Mi(t), . . . , M„ 
Its eigenvalues fii{t), . . . are given by the formula 

d^ d"^~^ d 

where gi{t), . . . , gm-iif) o.^^^ the polynomials fll0.3p associated to the critical point x and 
is the polynomial with roots s. 

Observe that (110. lip is similar to the formula (110. 5p for the differential operator of the 
critical point x. This similarity is made more precise if we replace the Gaudin Hamiltonians 
by a different set of operators. Consider the differential operator formally conjugate to 
(-1)"^M, 

jm jm-1 J 

= d^- dF^^^'^'^ + ■■■ + (-ir"'^^™-w + i-^rMmit) 



dt^ '''dt^-^ ''"'^'dt 

These coefficients Ki{t) are operators on V^^^ that depend rationally on t, and are also 
called the Gaudin Hamiltonians. Here are the first three, 

K^{t) = -M,{t), K,{t) = M^it) - nM[it), 

K,{t) = -M,{t) + {n-l)M'^{t) - (^^Ml'{t), 

and in general Ki{t) is a differential polynomial in Mi(t), . . . , Mi{t). 

Like the Mi{t), these operators commute with each other and with sim, and the Bethe 
vector u(x, s) is a joint eigenvector of these new Gaudin Hamiltonians Ki{t). The corre- 
sponding eigenvalues Ai(t), . . . , Xm(t) are given by the formula 

d"^ d 

>^l(t)——r + ■■■ + Xrn-lit)- + Xrait) = 



dt"" ""^''dt 

:|-'"'(£;))(l--'(S))-(l-'<S)(l-'"''->'' 
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which is (!) the fundamental differential operator Dy^ of the critical point x. 

Corollary 10.7 Suppose that s G C'"*' is generic. 

1. The Bethe vectors form an eigenbasis of sing(V^™^[0]) for the Gaudin Hamiltonians 

K^{t),...,Kmit). 

2. The Gaudin Hamiltonians Ki{t), . . . , Km{t) have simple spectrum in that their eigen- 
values separate the basis of eigenvectors. 

Statement (1) follows from Theorems 110. 51 and 110. 61 For Statement (2), suppose that two 
Bethe vectors f(x, s) and f(x',s) have the same eigenvalues. By (110. 12p . the corresponding 
fundamental differential operators would be equal, = -Dx'. But this implies that the 
fundamental spaces coincide, = Vx'- By Theorem 1 1 . 2 1 the fundamental space determines 
the orbit of critical points, so the critical points x and x' lie in the same orbit, which implies 
that f (x, s) = f (x', s). 

10.3 Shapovalov form and the proof of the Shapiro 
conjecture 

The last step in the proof of Theorem 11.91 is to show that if s G M™^ is generic and x is a 
critical point of the master function (110. ip . then the fundamental space Vx_ of the critical 
point X has a basis of real polynomials. The reason for this reality is that the eigenvectors 
and eigenvalues of a symmetric matrix are real. 

We begin with the Shapovalov form. The map r: Eij i— )■ Ej^i induces an antiautomor- 
phism on slm- Given a highest weight module V^, and a highest weight vector t> G V^[/i] the 
Shapovalov form (■, ■) on is defined recursively by 

{v,v) = 1 and {g.u,v) = {u,T{g).v) , 

for g G slm and u,v E V. In general, this Shapovalov form is nondegenerate on and 
positive definite on the real part of V^. 

For example, the Shapovalov form on K;„_^ is the standard Euclidean inner product, 
(f j, Vj) = 6ij, in the basis vi, . . . ,Vm of Section 110.2.31 This induces the symmetric (tensor) 
Shapovalov form on the tensor product V^*"*^, which is positive definite on the real part of 

Theorem 10.8 (Proposition 9.1 in [80] ) The Gaudin Hamiltonians are symmetric with 
respect to the tensor Shapovalov form, 

{Ki{t).u,v) = {u,Ki{t).v), 

for alii = l,...,m,te C, and u,v e Yf"'^ . 

J 117 7 1 777,-1 

We give the most important consequence of this result for our story. 
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Corollary 10.9 When the parameters s and variable t are real, the Gaudin Hamiltonians 
Ki{t), . . . , Km{t) are real linear operators with real spectrum. 

Proof. The Gaudin Hamiltonians Mi(t), . . . , Mm{t) are real linear operators which act on the 
real part of y®"^p^ by their definition. The same is then also true of the Gaudin Hamiltonians 
Ki{t),. . . ,Km{t). But these are symmetric with respect to the Shapovalov form and thus 
have real spectrum. 

Proof of Theorem \1.9[ Suppose that s G M™^ is general. By Corollary 110.91 the Gaudin 
Hamiltonians for t G M acting on sing(V^^_ jO]) are symmetric operators on a Euclidean 
space, and so have real eigenvalues. The Bethe vectors u(x, s) for critical points x of the 
master function with parameters s form an eigenbasis for the Gaudin Hamiltonians. As s 
is general, the eigenvalues are distinct by Corollary 110.71 (2), and so the Bethe vectors must 
be real. 

Given a critical point x, the eigenvalues Ai(t), . . . , \m(t) of the Bethe vectors are then 
real rational functions, and so the fundamental differential operator has real coefficients. 
But then the fundamental space of polynomials is real. 

Thus each of the #m,p spaces of polynomials \^ whose Wronskian has roots s that were 
constructed in Section flO.ll is in fact real. This proves Theorem 11.91 
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Chapter 11: 

Beyond the Shapiro Conjecture 

Here, we will touch on further topics related to the Shapiro Conjecture, including 
I Transversality and Discriminants. 
II Maximally Inflected Curves. 

III The Shapiro Conjecture for flag manifolds (Monotone Conjecture). 

IV The Secant Conjecture and the Monotone Secant Conjecture 

V The Shapiro Conjecture for Lagrangian and Orthogonal Grassmannians. 

This chapter currently only in a rough draft, and will require a complete rewrite. 

11.1 Transversality 

In Chapter [9l we presented a proof of the Shapiro Conjecture for rational functions by 
Eremenko and Gabrielov [32]. Its main point was that there is no obstruction to analyti- 
cally continuing the rational functions that were constructed in Chapter [8] to give rational 
functions with any given Wronskian having distinct real zeroes. The key to this was the 
association of a net to each rational function with only real critical points, which showed 
that the analytic continuation was possible. 

A consequence of this proof is the statement that when min(m,p) = 2, the Wronski map 

Wr : Gr(p,C™+P) CP™^' 

is unramified over the locus of hyperbolic polynomials with distinct (real) roots. This is in 
fact true for all Grassmannians, as Mukhin, Tarasov, and Varchenko showed [82] . 

Theorem 11.1 The Wronski map is unramified over the locus of hyperbolic polynomials 
with distinct roots, for any m and p. 

More generally, given called Schubert data, ai, 0^2, . . . , an we can consider intersections 
of the form 

x^,F,{si) n x^,F,{s2) n ■■■ n x,„F.(s„) , (ii.i) 

where Si, S2, . . . , s„ G P^. Recall that a collection of subvarieties meet transversally if at all 
points of their intersection, their tangent spaces meet properly (have the expected dimension 
of intersection). We rephrase Theorem 111.11 

Transversality Theorem. // the points Si, S2, ■ ■ ■ , Sn are real and distinct, then the inter- 
section flll.ip is transverse. 

This Transversality Theorem has a strengthening, which is stated in terms of real algebra. 
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Definition 11.2 The discriminant is the locus in P™^ of critical values of the Wronski map 
(points over which it is ramified). It is an algebraic hypersurface and defined by a single 
polynomial, also called the discriminant. 

The Transversality Theorem asserts that the discriminant does not meet the set of poly- 
nomials with distinct roots. 

More generally, if we consider a given family of intersections of the form (111.11) . then the 
discriminant is the set of points (si, S2, . . . , s„) for which this intersection is not transverse. 
Again, this discriminant is a polynomial A(si, S2, . . . , s„) in the parameters (si, S2, . . . , Sn) G 
(P^)", and the Transversality Conjecture asserts that A does not vanish when the parameters 
are real and distinct, that is, when Si 7^ sj, for all We conjecture something much 
stronger. 

Conjecture 11.3 (Discriminant Conjecture) The discriminant polynomial A is a sum 
of squares. Each term in the sum is a monomial in the differences (sj — Sj). 

There is some evidence for this conjecture. For the problem of two lines in 3-space 
meeting four lines i{s), i(t), i{u), and i{v), each tangent to the rational normal curve, the 
discriminant is a constant multiple of 

{s-ty{u-vy + {s-uYit-vf + {s-vy{t-u)\ 

For Gr(2,5), the discriminant of the Schubert intersection 

X^F.iO) n X^F.is) n X^F.it) n X^F.iu) n X^F.iv) n XaF.(oo) , 

has degree 20 in the variables s,t,u, and v, and it has 711 different terms. Ad hoc meth- 
ods |110j showed that it was a sum of squares.^ This was quite surprising. Hilbert [50] 
showed that not every polynomial of even degree greater than 2 in four variables that is 
nonnegative can be written as a sum of squares. 

We feel that it is a useful project to further investigate these discriminant polynomials. 

11.2 Maximally inflected curves 

This section represents joint work with Kharlamov [62]. A list fi{t), /2(t), . . . , fk{t) of poly- 
nomials with degree d defines a map : — t- as follows, 

; pi3t ^ ^f^^t):f,it):---:f,it)]e¥'-K 

The image of the curve is convex at a point (f(t) if and only if the first k—1 derivatives of 
(f{t) are linearly independent. The failure to be convex is measured exactly by the vanishing 
of the Wronskian of the polynomials fi(t), f2(t), . . . , fk(t). We say that ip is raniiGed at a 
point t if this Wronskian vanishes at t. Another geometric term is that the curve ip has an 
inflection point (or fiex) at t. This corresponds exactly to the usual notion of an infiection 

^^Give more details? 
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point for plane curves. A rational curve of degree d in P^"^ has k[d+l—k) flexes, counted 
with multiplicity. 

The connection between the Schubert calculus and rational curves in projective space 
(linear series on P^) originated in work of Castelnuovo [19] on (7-nodal rational curves. This 
led to the use of Schubert calculus in Brill-Noether theory (see Chapter 5 of [19] for an 
elaboration). In turn, the theory of limit linear series of Eisenbud and Harris [30l [31] 
provides essential tools to show reality of the special Schubert calculus }108j . (That result 
on the special Schubert calculus was a generalization of Theorem 18.21 of Chapter [HI) 

Mukhin, Tarasov, Varchenko Theorem for rational curves. // a rational curve in 
P^^^ has all of its flexes real, then it must he real. 

Definition 11.4 A real rational curve with all of its flexes real is said to be maximally 
inflected. 

The Shapiro conjecture asserts that there are lots (maximally many, in fact) of these 
maximally inflected curves. 

Up to projective transformation and reparameterization, there are only three real rational 
plane cubic curves. They are represented by the equations 

2 3,2 2 32 i 2 3 

y = x+x, y = X — X , and y = x , 
and they have the shapes shown in Figure 111.11 All three have a real flex at infinity and are 



Figure 11.1: Real rational cubics. 

singular at the origin. The first has a real node and no other real flexes, the second has a 
solitary point and two real flexes at (|, ±^:^) (we indicate these with dots and the complex 
conjugate tangents at the solitary point with dashed lines), and the third has a real cusp. 
The last two are maximally inflected, while the first is not. 

The Schubert calculus gives 5 rational quartics with 6 given points of inflection and 
Figure [11.21 shows 5 maximally inflected curves with flexes at {— 3, — 1, 0, 1, 3, 00}. (Each 
nodal curve has 2 flexes at its node, which is a consequence of the symmetry in the choice 
of flexes.) We indicate the differences in the parameterizations of these curves, labeling the 
flex at —3 by the larger dot and the flex at —1 by the circle. The solitary points are not 
drawn. The first three curves have two solitary points, while the last 2 have three solitary 
points. 
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Figure 11.2: The 5 curves with flexes at {—3, —1, 0, 1, 3, oo}. 

Here are a few quintics. The flexes are indicated (the symmetric curves have one addi- 
tional flex at infinity), but we do not draw solitary points. Also, the open circles represent 
two flexes which have merged into a planar point. 




Finally, here are four singular quintics. The solitary point is drawn on the second. 




None of these maximally inflected curves had many nodes. Indeed, rational quartics 
typically have 3 double points and quintics have 6, yet we saw quartics with at most 1 node 
and quintics with at most 3 nodes. More generally, consider a maximally inflected curve 
with only flexes and cusps, and whose other singularities are ordinary double points. Let l 
be its number of flexes and k be its number of cusps. Then l + 2k = 3{d — 2). By the genus 
formula, it has ('^2^) — k double points. The following theorem is an easy consequence of 
the Klein [BB] and Pliicker [HH] formulas. 
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Theorem 11.5 (Topological Restrictions [62]) Suppose that a maximally inflected curve 
has only flexes, cusps, and ordinary double points, and let l and k be its number of flexes 
and cusps, respectively. Then it has at least d — 2 — k solitary points and at most (''2^) 
nodes. 

Thus maximally inflected cubics have at most (^2^) = nodes, quartics have at most 
("^2^) = 1 node, and quintics have at most (^2^) = 3 nodes. 

The existence of curves satisfying the hypotheses of Theorem 1 11 . 51 is not guaranteed, even 
if we know the Shapiro conjecture. For example, the construction in Theorem 4 of Chapter [8] 
does not guarantee that the maximally inflected curve has only ordinary double points. 
There are, however two constructions which guarantee curves having only double points. 
The flrst uses Shustin's patchworking of singular curves |101] to obtain degenerate Harnack 
curves with ("'2^) solitary points (the flgure on the left illustrates this patchworking). The 
second perturbs d — 2 lines tangent to a conic to obtain maximally inflected curves of degree 
d with the minimal number of solitary points (and up to c? — 2 cusps). 




The topological classiflcation of maximally inflected plane quintics is open.''^ Also, maxi- 
mally inflected curves in higher-dimensional space have not been investigated. For example, 
which knot types can occur for maximally inflected space curves? We speculate that there 
should be some restrictions on the number of real quadrisecants of a maximally inflected 
space curve. 

11.3 The Shapiro Conjecture for flag manifolds 

The Shapiro conjecture involved linear spaces satisfying incidence conditions imposed by 
flags osculating the rational normal curve at real points. One natural variant is to consider 
flags satisfying incidence conditions imposed by osculating flags. 

Example 11.6 Consider partial flags of subspaces m G H, where m is a line which lies on 
a plane if in P"^. Here are two typical incidence conditions on these flags. 

• m meets a flxed line i{t) tangent to the rational normal curve 7 at the point 7(t). 
^Explain a couple of open questions 
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• H contains a fixed point 7(t) of tlie rational normal curve. 

We consider the problem of partial flags m G H where m meets three tangent lines 
^(0), and to 7, and H contains two points j{v) and 7(w). First observe that 
H contains the line X{v,w) spanned by these two points. But then m must also meet this 
secant line X{v,w), a.s m G H. In this way, we see that this problem reduces to our old 
friend, the problem of four lines. Now, however, we seek lines m which meet three tangent 
lines -^(—1), ^(0), and to the rational normal curve, and one secant line X{v,w). Given 
such a line m, we may recover H as the linear span of m and X{v, w). 

As in Section 4 of Chapter [3, we begin with the quadric Q containing the three lines 
£(— 1), £(0), and tangent to the rational normal curve 7. The lines meeting £{ — 1), f(0). 




Figure 11.3: Quadric containing three lines tangent to the rational normal curve. 

£(1), and the secant line X{v,w) correspond to the points where X{v,w) meets the quadric 
Q. In Figure lll.4[ we display a secant line X{v, w) which meets the hyperboloid in two 
real points, and therefore these choices for v and w give two real solutions to our geometric 
problem. There is also a secant line which meets the hyperboloid Q in no real points, and 
hence in two complex conjugate points. For this secant line, neither flag solving our problem 
is real. We show this configuration in Figure 111.51 

The original Shapiro conjecture concerned partial flags of a given type meeting flags 
osculating the rational normal curve (at real points), and it asserted that all such partial 
flags would be real. This example shows that the Shapiro conjecture fails for flags, but not 
too badly. It is useful to consider this failure schematically. In Figure [Tl. 31 we represent the 
rational normal curve 7 as a circle, and indicate the relative positions of three tangent lines 
and the secant line. Observe that in the diagram on the left, one may travel along the circle, 
first encountering the three points where the lines are tangent and then the two points where 
the line is secant. If we record the dimension of the piece of the flag m G H which is affected 
by the tangent line or point on the rational normal curve as we travel along the circle, we 
get the sequence (1, 1, 1, 2, 2), which is weakly increasing. The reading the diagram on the 
right either gives the sequence (1, 1, 2, 1, 2), or its reverse (2, 1, 2, 1, 1), or (1, 2, 1, 2, 1), none 
of which are monotone. 
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£(1) 



£(-1) 



A('L', w) 




m 



Figure 11.4: A secant line meeting Q. 



11.3.1 The Monotone Conjecture 

A partial Bag E, of type a = (oi, a2, . . . , a^) is a sequence of linear subspaces 



E, : Ea^ C Ea2 C ■ ■ ■ C Ea^^ C C" , 



where = dim Ei. Its possible positions with respect to a fixed (complete) flag are encoded 
by certain permutations, a. The original Shapiro conjecture concerned partial flags of a given 
type meeting flags osculating the rational normal curve (at real points), and it asserted that 
all such partial flags would be real. The example that we just gave shows that it was wrong'''. 
However, it can be repaired. 

A position a for flags E, of type a which may be expressed in terms of only one piece 
Ea^ of the flag E, is called Grassmannian of index ai. When this occurs, write L{a) = ai. 
In the counterexample to the Shapiro conjecture, both conditions are Grassmannian — one 
has index 2 (for the requirement that m meet a tangent line) and the other has index 3 (for 
the requirement that H meet a point on the rational normal curve.) These differ from the 
dimension of the linear spaces m and H, as we were working in projective 3-space, rather 
than a 4-dimensional vector space. 

Conjecture 11.7 (Monotone Conjecture) Let ai, ^2, . . . , a-m be Grassmannian condi- 
tions on flags of type a whose indices satisfy i{ai) < ^(02) < ■■■ < ^(am)- Then if 
Si < 82 < ■ ■ ■ < Sm are real numbers, the intersection 



m 



i=l 



is transverse with all points real. 



Hn fact, it fails for the first nontrivial problem on a flag manifold. 
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Figure 11.6: Always real. Not always real. 

As with the Shapiro conjecture for Grassmannians, it is known that transversality will 
imply that the points of intersection are real. There is also a similar discriminant conjecture. 
The prephme generated by polynomials gi,g2, ■ ■ ■ ,gm is the collection of polynomials / of 
the form 

/ = (To + aigi + (T25'2 + ■■■ + (Jmgm, 

where the polynomials a are sums of squares. Any polynomial / in this preprime is positive 
on the set 

K := {x\ g,{x) > 0} 

and a representation of / as an element of this preprime is a certificate of its positivity on 
K. 

Conjecture 11.8 (Discriminant Conjecture) The discriminant of the polynomial sys- 
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tem associated to an intersection of Schubert varieties defined by Grassmannian conditions 
as in the monotone conjecture lies in the preprime generated by the differences (sj — sj) for 
i > j. 

The point of this conjecture is that not only is the discriminant positive on the set 
{(si, S2, ■ ■ ■ , Sm) \ Si < S2 < ■ ■ ■ < s^} of monotonc parameters, but that it has a special 
form from which this positivity is transparent. It is known that not all polynomials which 
are positive on a set of this form can lie in the preprime generated by the differences Si — Sj 
for i > j. 

There is a continually unfolding story of the Shapiro conjecture and its generalizations, 
both to secant flags (by Eremenko, Gabrielov, Shapiro, and Vainshtein |35]) and for other 
flag manifolds, for other groups. Very few computations have been done (but see [llTj for 
the beginnings for the Lagrangian Grassmannian and orthogonal Grassmannian.) 

We end these notes with the remark that, just as they were being completed, Mukhin, 
Tarasov, and Varchenko [81] gave a proof the the Shapiro conjecture for Grassmannians, 
using the Bethe Ansatz and the Gaudin model (a connection between differential equations 
with polynomial solutions, representation theory of the special linear group, and intersections 
of Schubert varieties given by flags osculating the rational normal curve) . The discriminant 
conjecture remain open, however. 

Topics to work on for next rewrite: 

1. Discuss some of the evidence for the Monotone Conjecture 

2. Explain how the result of Eremenko, Gabrielov, Shapiro, and Vainshtein (Section 9.2) 
establishes the monotone conjecture for certain flag manifolds. 

3. Pose the Secant Conjecture and discuss its the evidence for it. Include gaps, and relate 
to Chapter 6. 

4. Pose the Monotone secant conjecture and give a few words about current computations. 

5. Explain (briefly) the Shapiro conjecture for Lagrangian and Orthogonal Grassmanni- 
ans, including Purbhoo's proof for Orthogonal Grassmannians. 
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